Gene Tpet_1493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1493 
Symbol 
ID5171161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1479095 
End bp1481368 
Gene Length2274 bp 
Protein Length757 aa 
Translation table11 
GC content50% 
IMG OID640564019 
ProductMutS2 family protein 
Protein accessionYP_001245077 
Protein GI148270617 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATTATC TCGAATCACT CGATTTTCCA AAAGTTGTGG AGATAGTAAA GAAATACGCG 
CTCTCTGACC TGGGAAGAAA ACATCTGGAC ACTCTCAAAC CGACGGTGAA TCCGTGGGAC
GAGCTCGAAC TCGTGGAGGA GCTTCTGAAC TATTTTGCCA GGTGGGGAGA GCCTCCCATA
AAGGGATTGA ACGATATCTC TCAGGAAGTG GAGAGGGTGA AGTCCGGTTC AGCCTTGGAA
CCATGGGAAC TTCTTCGCGT TTCCGTGTTT CTCGAAGGCT GTGACATCCT GAAGAAGGAC
TTCGAAAAGC GTGAATACAG CAGGCTCAAA GAGACCTTCT CCAGACTCAG CTCCTTCAGA
GATTTCGTGG AAGAGGTGAA CAGGTGCATA GAACAGGATG GAGAGATCTC GGACCGTGCG
AGTCCCAGAT TGAGGGAGAT CAGAACGGAA AAGAAGCGTC TTTCCAGCGA GATAAAGAGA
AAGGCCGATG ATTTCGTCAG GACGCACTCT CAGATCCTTC AGGAACAGAT GTACGTTTAC
AGGGATGGAA GGTATCTCTT CCCCGTGAAG GCTTCCATGA GGAACGCAGT GAGGGGAATC
GTTCACCATC TCTCCTCTTC CGGTGCCACC GTCTTTCTGG AGCCCGACGA GTTCGTCGAA
CTGAACAACA GAGTGCGTCT TTTAGAAGAG GAGGAAAGGC TGGAGATCAG CAGGATCCTG
AGACAGCTGA CGAACATACT CCTTTCCAGG CTCAACGACC TTGAGAGGAA CGTGGAACTC
ATAGCGCGTT TCGACTCCCT CTACGCGAGG GTGAAGTTCG CAAGAGAGTT CAACGGAACC
GTCGTGAAAC CTTCTTCGCG GATAAGACTT GTCAACGCAA GACATCCGTT GATACCAAAG
GAGAGAGTCG TTCCAATAAA TCTTGAGCTT CCACCCAACA AAAGAGGTTT CATCATAACG
GGTCCAAACA TGGGCGGAAA GACCGTGACT GTGAAAACCG TTGGTCTTTT TACCGCCCTC
ATGATGAGTG GCTTCCCTCT CCCCTGTGAT GAAGGAACGG AGCTGAAGGT CTTTCCAAAG
ATCATGGCGG ACATCGGTGA GGAGCAGAGC ATAGAACAGA GCCTCAGCAC CTTCTCGTCG
CACATGAAGA AGATCGTGGA GATCGTGAAG AATGCAGACA GCGATTCACT GGTCATCCTC
GACGAGCTGG GCTCGGGGAC GGATCCGGTG GAAGGAGCCG CCCTTGCCGT CGCGATAATA
GAGGATCTTC TGGAAAAAGG AGCAACGATC TTTGTAACCA CACACCTCAC ACCTGTGAAG
GTCTTCGCCA TGAACCATCC TCTGCTCTTG AACGCCTCCA TGGAGTTCGA TCCGGAAACC
CTCTCACCAA CTTACAGGGT TCTGGTTGGA GTTCCAGGAG GTTCTCACGC TTTCCAGATA
GCAGAAAAGT TGGGACTCGA CAAACGTATA ATCGAAAACG CCAGATCGAG ACTCTCTCGG
GAGGAGATGG AACTCGAGGG ACTCATAAGG TCTCTCCACG AGAAGATCTC GCTTCTTGAA
GAGGAGAAGA GAAAACTCCA GAAAGAACGC GAAGAGTACA TGAAACTGAG GGAGAAGTAC
GAGGAAGATT ACAAAAAGCT GAGGAGGATG AAGATAGAAG AGTTCGACAA AGAGCTGAGG
GAGCTCAACG ATTACATCAG AAAGGTCAAG AAGGAACTCG ATCAGGCGAT ACACGTGGCA
AAAACTGGCA GCGTTGACGA GATGAGAGAA GCGGTGAAGA CGATAGAGAA AGAGAAGAAA
GATCTGGAGC AAAAGAGAAT CGAAGAAGCG ACCGAAGAAG AAATAAAACC CGGAGATCAC
GTGAAAATGG AAGGTGGAAC CTCTGTGGGG AAGGTCGTTG AGGTGAAAAG TGGCACCGCC
CTTGTTGACT TTGGCTTTCT CAGATTGAAG GTGCCCGTTT CGAAACTGAA AAAGGCTAAA
AAAGAGGAGA AGGAAGAATC TTCAGCGGTC TCTTACAGGC CTTCGAGCTT CAGAACGGAA
ATAGACATAA GGGGTATGAC GGTTGAAGAA GCGGAGCCGG TTGTGAAGAA GTTCATCGAT
GACCTGATGA TGAACGGCAT CAGCAAGGGA TACATAATAC ACGGAAAGGG CACCGGAAAG
CTCGCATCTG GAGTATGGGA AATACTGAGA AAGGACAAAA GAGTGGTTTC TTTCAGATTC
GGAACACCTT CTGAGGGGGG AACGGGTGTC ACGGTGGTGG AGGTGAAAGT GTGA
 
Protein sequence
MDYLESLDFP KVVEIVKKYA LSDLGRKHLD TLKPTVNPWD ELELVEELLN YFARWGEPPI 
KGLNDISQEV ERVKSGSALE PWELLRVSVF LEGCDILKKD FEKREYSRLK ETFSRLSSFR
DFVEEVNRCI EQDGEISDRA SPRLREIRTE KKRLSSEIKR KADDFVRTHS QILQEQMYVY
RDGRYLFPVK ASMRNAVRGI VHHLSSSGAT VFLEPDEFVE LNNRVRLLEE EERLEISRIL
RQLTNILLSR LNDLERNVEL IARFDSLYAR VKFAREFNGT VVKPSSRIRL VNARHPLIPK
ERVVPINLEL PPNKRGFIIT GPNMGGKTVT VKTVGLFTAL MMSGFPLPCD EGTELKVFPK
IMADIGEEQS IEQSLSTFSS HMKKIVEIVK NADSDSLVIL DELGSGTDPV EGAALAVAII
EDLLEKGATI FVTTHLTPVK VFAMNHPLLL NASMEFDPET LSPTYRVLVG VPGGSHAFQI
AEKLGLDKRI IENARSRLSR EEMELEGLIR SLHEKISLLE EEKRKLQKER EEYMKLREKY
EEDYKKLRRM KIEEFDKELR ELNDYIRKVK KELDQAIHVA KTGSVDEMRE AVKTIEKEKK
DLEQKRIEEA TEEEIKPGDH VKMEGGTSVG KVVEVKSGTA LVDFGFLRLK VPVSKLKKAK
KEEKEESSAV SYRPSSFRTE IDIRGMTVEE AEPVVKKFID DLMMNGISKG YIIHGKGTGK
LASGVWEILR KDKRVVSFRF GTPSEGGTGV TVVEVKV