Gene Nther_2141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2141 
Symbol 
ID6314800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2263527 
End bp2265401 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content35% 
IMG OID642644528 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_001918295 
Protein GI188586750 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCTA ATTACAGCAG TGTCAATAAC AGATCCGAAG CCAATAGTAT TAAAAGTAGA 
TACAAAGCAC GTAAAGAACA ATTCGCAAGT GAATCAAAAG CACTTTCTAA CAAAATGAAT
CTAATATCTA ATTTAAGGTT ACTAACCGTT GTTTTAGGTA TTGGGTTAAC TGGCTATTTG
TTTTATATTG GGAATTATGT AGTCAGTTTT TTCTCTCTAA TCATTTTTAC AACAATTTTC
ATATTTCTGG TCATCAGATA TCGTGTATTA ACAAATAAGC GAAATTATGC CTGGGCCCTT
GCCGATATTA ATGAGCGTTC TCTTATCAGA TTAAAGGGGA GTTGGAATAG TTTTGAGGAT
ACTGGTAATG AATTTTTAGA CCATGATCAT CCCTATGCAG AAGATCTAGA TATCTTCGGA
CAAAATTCCC TTTTCCAGTG GATAAATACC ACTACTACCT TTTCGGGAAG GCATAAATTA
AAGACCATGT TAACACAACC ATGTAATAAG ATAGAAGAGA TACAAGCAAG GCAAAGGTCC
ATTCAGGAAT TAGCCGAGAA ACTCCATTGG CGTCAACATT TAGAAGCTTT AGGGAAACAA
CCCGAATACG AACACCGCAA TAACAAAGAA CTCCAAGTTG ATCCCCTTAA TTTAATATCA
TGGGCAAAAA CACAAAACCC CTTTTATTTA AAAACTTGGC TTAAGGTGAT CGTGAATCTA
CTACCCTTAA TAACTCTAAG TTTTATAATC GCAGCAATTA TGACTGGAGT CACTTATATT
TTTCCTATAA TTATGATTGC ACTACACATA TTGATCCTTA CTTATGATTA CTCTAATCGC
ATTAGCGAGT TCAGCTTGAT CTCAGGCTTT AATGAAAAAT TGTCGGCTTA CAAAGATATT
TTAACAGCCA TAGAAACAGA ACAATTTCAT GGAAGTATGT TAACAAAACT ACAGGATACA
ATATTAAATT CCAAAACGGG GACAAATGCT TCGGATCGTC TAAAGCTCCT TGACGAGATA
ATGGAATTCA TTTCGCATCG CTCAGGCCAG TTTTATATTA TATTTAATAT CTTATTTCTA
TTGGATTATC GATGGCAAAT TTCTTTAGAA CATTGGAAAC ATCAATCAGG AGATGAGCTT
GAACAATGGT TTGATATTTT GGGAGATTTT GAAGCCCTGA GTAGTTTAGC GATAATTCCA
TGTGATCATC CTGATTGGGC TCAACCTGAG ATCACAGCAG AACCTGGTCT ATTTCAGGCC
GAACAAATAG GCCACCCTTT ATTAACAGAA CATCGAGTTT GCAACGATAT TGATATGGGT
TCAAGTACAA ATAGTTTACT AATTACGGGT TCCAATATGT CTGGTAAAAG TACTTTACTC
AGAACAGCTG GAATCAATCT GGTACTAGCA TATTTGGGTG CTCCCGTTTG TGCAAATACT
ATGCAGGCGT CATTAATGAA AATTTATACA TGTATGAGAG TTAGCGATAA TTTAGAAAAG
AATTTATCTT CTTTTTATGC CGAACTACTT AGAATTAAAC ATATAGTTAA AAGTGCTGAA
CAGATACCAG TTTTTTATTT ATTAGACGAG ATTTTTAAAG GGACTAATTC TCGTGACCGA
CATACTGGTG CTAGAGCTGT AATTAAAAAA TTACAATCAG AAGGTGCCCT GGGCCTTGTT
TCAACACACG ATCTAGAATT AGGTGCCCTG GAGAGTCAAA ACACAAGTAT AAAAAATTAC
CATTTCAGAG AGTACTATCA AAATGGTGAA ATTTACTTTG ATTATATTTT GAGACCTGGG
TTGGCTCCTA CTACTAATGC TATTTATCTA ATGAAAATGG CAGGCATCGA TCCAGATGAA
GAAGATTTGG GTTAA
 
Protein sequence
MSANYSSVNN RSEANSIKSR YKARKEQFAS ESKALSNKMN LISNLRLLTV VLGIGLTGYL 
FYIGNYVVSF FSLIIFTTIF IFLVIRYRVL TNKRNYAWAL ADINERSLIR LKGSWNSFED
TGNEFLDHDH PYAEDLDIFG QNSLFQWINT TTTFSGRHKL KTMLTQPCNK IEEIQARQRS
IQELAEKLHW RQHLEALGKQ PEYEHRNNKE LQVDPLNLIS WAKTQNPFYL KTWLKVIVNL
LPLITLSFII AAIMTGVTYI FPIIMIALHI LILTYDYSNR ISEFSLISGF NEKLSAYKDI
LTAIETEQFH GSMLTKLQDT ILNSKTGTNA SDRLKLLDEI MEFISHRSGQ FYIIFNILFL
LDYRWQISLE HWKHQSGDEL EQWFDILGDF EALSSLAIIP CDHPDWAQPE ITAEPGLFQA
EQIGHPLLTE HRVCNDIDMG SSTNSLLITG SNMSGKSTLL RTAGINLVLA YLGAPVCANT
MQASLMKIYT CMRVSDNLEK NLSSFYAELL RIKHIVKSAE QIPVFYLLDE IFKGTNSRDR
HTGARAVIKK LQSEGALGLV STHDLELGAL ESQNTSIKNY HFREYYQNGE IYFDYILRPG
LAPTTNAIYL MKMAGIDPDE EDLG