Gene Tpet_1786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1786 
Symbol 
ID5170129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1796435 
End bp1797940 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content42% 
IMG OID640564307 
ProductDNA mismatch repair protein MutS domain-containing protein 
Protein accessionYP_001245362 
Protein GI148270902 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00151585 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGTAC TTTTGATGTA CCCAGACAAG GATTTCAATT TGAAAAGAGA GTTGCCTTTC 
AATGCAGATG ATTTGACAAG AGATCTCGGT TTAGATGTGA TATTCGATCA CATGGCAAAG
GGAGATGGTT ACCTGTATAG TGTTGTGAGA AATGTCATTC TGAATCCAGA AACCGATCTG
GAGACGATAA AGTACCGCCA GGAAATTCTC AAAGATTGTA TGAAAAACCA GAACGTCGTA
AGGCGGTTGT TTCAGATACC GCTGGAGGTT CAAGAAAACA AGAAGAAAAA TTGGTGGGGG
GTTTTTGGAT GGAAAACTCC TATCAATGTC CTGAACGGCT CCAGAAAAGC TTTAGAAGCC
ATGCTGGTAG CGCTCAGAGA GCTTAAAAAG TTGGCAGATG AACATCGTCA CAATTTTCAT
TCTCGAGGCT TCACAAGGTT TTTCGAGATG ATAAGGACGG AACTGGATGA AGCCTACTTA
CAAACTGTGG AGAAGCATCT CATTAATTTG AGATTTTCAA ACGGTATGTT GTTCAAAGTA
AAACTCGGAA AAGGTAACGA AGGAAAAGAT TACACCCTCT GCCAGCCTGA CTCTTCAAGA
AGCATCCTGA AAAGACTACT CAGTGTGAGA CGAATGTATT CTTATAAATT ACATCCAAGA
GATGAAAGTG GAGCACGAGC ACTGGAGAAG TTGACCAATT TGGGACTTCG CAGGGTAGCA
GCCACAGTTT ACTACGCGGC AGAGCATGTA GAAAAATTTT TGAACAAAAT TCGAGAAGAA
CTGGCTTTCT ATATTGGTTG TCTCAACTTG CTGGAAGATG TAGAAAAATC GAAGATAAGT
TTTCCCGATC CCAAACCAAT TGATGAGGAC GATGTAACCG CCTTCAGAGG GCTTTACGAT
CTTAGTTTGC TTCTAATAAA GAGAAATAGA GTGATAAGTA ACGATCTGAA CACGCGTGGT
AAAAGAGTTT TTTTCATCAT GGGAGCGAAT CGCGGTGGGA AGACCACTTT CCTGAGAAGT
ATAGGGCAAG CTCAGCTGAT GATGCAAGCT GGTATGTTCG TTCCGGCGTC GTACTTTGAA
TCGAACGTTT GCAAAGGAAT TTTTACGCAC TTTAAAAGAG AGGAAGATCC AAGCTTGAAG
AGAGGAAAAT TTGAAGAAGA ACTTGTCAGA ATGAATGAGA TAGTCCTTCA TCTGCATAGA
AGGTCTATGG TGCTATTCAA CGAGTCCTTC TCATCTACGA ATGAAATGGA GGGTTCCGAA
GTGGCCTACC AGATTGTTCG AGCTCTACTG GACAGCCGTG TTAAAGTTTT TTACGTAACA
CACGTGTACG AACTGGCCCG CCGTTTTACA GGAGATGAAA GAGTGATGTT TCTACAAGCA
GAAAGGAAAC CTACCGGTGA AAGAACCTTC AAGATCAAAG AAGGCCTGCC TTCGCAGACA
AGTCATGCGA AGGATATATA CCTCAAAGTG TTCAGATCAT CAACTTCAAC TTCACCATCC
TCTTGA
 
Protein sequence
MRVLLMYPDK DFNLKRELPF NADDLTRDLG LDVIFDHMAK GDGYLYSVVR NVILNPETDL 
ETIKYRQEIL KDCMKNQNVV RRLFQIPLEV QENKKKNWWG VFGWKTPINV LNGSRKALEA
MLVALRELKK LADEHRHNFH SRGFTRFFEM IRTELDEAYL QTVEKHLINL RFSNGMLFKV
KLGKGNEGKD YTLCQPDSSR SILKRLLSVR RMYSYKLHPR DESGARALEK LTNLGLRRVA
ATVYYAAEHV EKFLNKIREE LAFYIGCLNL LEDVEKSKIS FPDPKPIDED DVTAFRGLYD
LSLLLIKRNR VISNDLNTRG KRVFFIMGAN RGGKTTFLRS IGQAQLMMQA GMFVPASYFE
SNVCKGIFTH FKREEDPSLK RGKFEEELVR MNEIVLHLHR RSMVLFNESF SSTNEMEGSE
VAYQIVRALL DSRVKVFYVT HVYELARRFT GDERVMFLQA ERKPTGERTF KIKEGLPSQT
SHAKDIYLKV FRSSTSTSPS S