Gene Nther_1813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1813 
Symbol 
ID6317138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1883987 
End bp1886377 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content37% 
IMG OID642644190 
ProductMutS2 family protein 
Protein accessionYP_001917973 
Protein GI188586428 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.553679 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.572275 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGTTTA AAAAATCTAT GGATACTTTG GAATTACCTA AAATAATAGA CCAGTTGAAG 
AAAGAAACTG TTTCAACTAT GACTAAGGAA ATTTGTGACG ACCTGGACCC TAGTGTAAAT
TATAATGAGA TAAAGACCTG GTTAAAGGAG ACAAGTGAAG CTAAAGAACT ATTAGCAGAA
CGAGACATCT CTCTTAGAGG GTTGCGGGAT ATCAGAAAGC AACTTCAGTT AGCTGCTAAA
GATGGCACCT TACAGGGGCC CGAACTCTTT CAAATATCTG AAATTATTGG TGTGTCTAAT
AGAGTAAGAA AAATAACTGA TGATAATTTT CAAGCTAATT ATCCGATTTT ATCTTCATTA
ATATCTAAAT TGCCTGAATT AAACCATCTT AAAAAAGAAC TTGATGATAA AATTGATGAA
AATGGTGAAG TAAAAGATTC AGCTAGTGTC AACTTGAGAA ATATCAGGCA AAAAATTAAA
AAGCTTCAGT CCCAGGTTAA AACCAGCGTT AACCGGATAC TGCAAAGTGG GGAAAAATAC
CTTCAAGATA AAATTGTTAC TATGAGATAC GATAGGTACG TGGTACCTGT TAAAGCGGAA
TATCAAAATA TGGTACCAGG AATTATTCAT GATCAATCAT CTAGTGGGAT GACTGTATAT
ATAGAACCCA AAGAAGTAGT TGAAAAAAAT AATGAACTAA GGCAGGCAAA GCGCGAAGAA
CATAGTGAAC TCGAAAAAAT ATTACAGGGG TTAAGTCAGA AAATAAAAGG ATATCATTAT
CAGCTTCATG ATTCATTACA AATTCTTGTT GAGTTAGATT TTATTTTGGC TAAAGGTTCG
TTATCTCGGC GTATGAACGC CCGAGAAGCT GAACTTAATC AAGAAAAACG ATTGGAGATT
ATTAAAGGAA AACACCCTTT ATTGGGAGAA GATGCCATAC CTGTAGATGT AAAATTAGGC
GATGAATTTA ATACTATGGT GATCACCGGT CCAAATACAG GTGGTAAGAC GGTTAGTTTA
AAGATGGTAG GTCTGTTTAC TTTAATGACC CAATCTGGAC TTCATATTCC AGCAGAACGC
GGTACGGAAA TGGGTGTTTT TGAACAAGTC TTTGCTGATA TTGGAGACGA ACAGGATATT
GAGCAATCTC TTAGTACATT TAGTTCTCAT ATGTCCAATA TTGTAAAAAT AGTTGACCAC
GCAAATAGTG AGTCTTTGAT ACTTTTAGAT GAGTTAGGTG CAGGCACTGA CCCAACAGAA
GGGTCGGCCT TGGCTATGTC ATTGTTAGAG CATTTTCATA ATTTGGGTTG TCGAAGCATA
GCTACTACAC ATTATACTCA ATTAAAAAGT TTTGCTCATG CCCGAGAAGG TGTGGAAAAT
GCTTCGGTAG AGTTTGACGA AGAAACTTTG GAACCGACTT ATAATTTGTT AATAGGAGTG
CCAGGCAAAA GTAATGCTTT CGTAATTTCA AGAAGACTGG GATTAAGTGA CAAAATTATA
AGCAATGCTA AAAGCTTTTT AGCAGACGAA GAGATTGAAG TGGAAGAACT TATCACTTCT
CTAACAGAAA AAGAAAAGTC AAGCCAGAAA ATGAAAGAGG AATTAGAACG GGAGCGGGCT
AAAGTCGAGC AAGTAAAAGC CCAGCTAGAA CAAGAACGAA AAGAAATTTC CAGAAAAAAA
GATGAAGTTT TGCAAAAAGC CAGAAGACAA GCTGAGGAAA TTATTTCTGA TGCTAAAAGA
GATGCTGAAG AGTCTCTCAA AGAAGCTAGA AAAATAGCTG AGAAAAAGTC CCATAAAGAA
ATGGCGGAAG TTAGCTCTAA AGTTAGAGAT AAACTATCGG GACATCAACA AAAGTTACGA
GAAGAGTTGA TGGATTCGGC AGACTCGGTA CCTTTATCAC CTGAAAAGTT AAAGCCTGGA
TTGACGGTAT ATATTTCTAA TCTTGATAAA GAGGGTCAAA TCTTACAAGT TAATCATGAT
AAAGGTGAAG CAGAGGTTCA GGTAGGGATA ATGAAAGTAA ACGTAAATTT TTCAGATATA
TTTCCTTCTG AAGAAGAAAA ATCTGGCTCA ACCTTTTCCG GGAATGTTAA CTCTTCCTCC
TCTTCAACTG GTAGAGGCAA TGTTTTTGCC GGTAAAAAGG AAAGAATTTC AACTGAATTA
GATATTAGAG GAGAGCGGGT TGAAGAAGCC ATAAATCAGG TTGATAAATA TCTTGATGAT
GCTTTAGTTG CTGGATTGGC CGAGATTAGA ATAATCCATG GTAAAGGAAC TGGTAACTTA
AGAAAAGGAA TCCAATTTCA TCTAGAAGGC CACCCAATGG TTTCTCAGTA CAGATTGGGA
AACAGACAAG AAGGGGGAGA AGGAGTGACT GTGGTCAAGT TAAACAATTG A
 
Protein sequence
MTFKKSMDTL ELPKIIDQLK KETVSTMTKE ICDDLDPSVN YNEIKTWLKE TSEAKELLAE 
RDISLRGLRD IRKQLQLAAK DGTLQGPELF QISEIIGVSN RVRKITDDNF QANYPILSSL
ISKLPELNHL KKELDDKIDE NGEVKDSASV NLRNIRQKIK KLQSQVKTSV NRILQSGEKY
LQDKIVTMRY DRYVVPVKAE YQNMVPGIIH DQSSSGMTVY IEPKEVVEKN NELRQAKREE
HSELEKILQG LSQKIKGYHY QLHDSLQILV ELDFILAKGS LSRRMNAREA ELNQEKRLEI
IKGKHPLLGE DAIPVDVKLG DEFNTMVITG PNTGGKTVSL KMVGLFTLMT QSGLHIPAER
GTEMGVFEQV FADIGDEQDI EQSLSTFSSH MSNIVKIVDH ANSESLILLD ELGAGTDPTE
GSALAMSLLE HFHNLGCRSI ATTHYTQLKS FAHAREGVEN ASVEFDEETL EPTYNLLIGV
PGKSNAFVIS RRLGLSDKII SNAKSFLADE EIEVEELITS LTEKEKSSQK MKEELERERA
KVEQVKAQLE QERKEISRKK DEVLQKARRQ AEEIISDAKR DAEESLKEAR KIAEKKSHKE
MAEVSSKVRD KLSGHQQKLR EELMDSADSV PLSPEKLKPG LTVYISNLDK EGQILQVNHD
KGEAEVQVGI MKVNVNFSDI FPSEEEKSGS TFSGNVNSSS SSTGRGNVFA GKKERISTEL
DIRGERVEEA INQVDKYLDD ALVAGLAEIR IIHGKGTGNL RKGIQFHLEG HPMVSQYRLG
NRQEGGEGVT VVKLNN