Gene Tpet_1239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_1239 
Symbol 
ID5170821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp1253510 
End bp1254559 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content47% 
IMG OID640563763 
ProductApbE family lipoprotein 
Protein accessionYP_001244829 
Protein GI148270369 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGCCGT CAGATAGCAA CGATCATCGG GTAACCAGAA GAAACGTCAT CATCTTTTCT 
TCACTTCTTC TAGGTTCTCT TGCGATTCTC TTGGCTTTAC TCCTCATCCG AACGAAAGAT
CAGTATTATG AGCTCAGAGA TTTCGCTCTC GGAACGAGTG TGAGGATAGT CGTTTCCTCT
CAGAAGATAA ATCCCAGAAC GATCGCAGAA GCCATTCTGG AAGACATGAA GAGGATTACC
TACAAGTTTT CTTTCACGGA TGAAAGAAGT GTTGTGAAAA AGATAAACGA TCATCCCAAC
GAATGGGTCG AGGTGGACGA AGAGACTTAC AGTTTGATCA AAGCGGCCTG CGCGTTCGCA
GAGCTCACAG ATGGAGCGTT TGATCCGACA GTAGGAAGGC TTCTCGAACT CTGGGGGTTT
ACCGGAAACT ACGAAAATCT CAGGGTACCT TCTCGAGAAG AGATCGAAGA AGCTCTGAAG
CATATCGGAT ATAAAAACGT TCTCTTCGAC GATAAGAACA TGAGAGTGAT GGTTAAAAAC
GGTGTGAAGA TCGATCTTGG TGGTATAGCG AAAGGGTACG CCCTTGACAG AACTAGGCAG
ATAGCACTCT CTTTTGACGA GAACGCAACG GGGTTTGTCG AAGCAGGTGG GGATATTCGT
ATCATCGGGC CAAAATTTGG AAAGTATCCG TGGGTGATAG GAGTAAAAGA TCCCAGGGAA
GACAACGTGA TAGATTACAT CTATCTGAAA TCCGGAGCGG TTGCGACTTC CGGTGATTAC
GAAAGATATT TCGTTGTGGA CGGTGTCAGG TATCATCATA TTCTCGATCC TTCAACGGGG
TATCCTGCTC GTGGTGTGTG GAGCGTAACG ATCGTAGCCG AAGATGCCAC CACAGCCGAC
GCACTCTCCA CAGCGGGCTT TGTGATGGCC GGAAAAGACT GGAGGAAGGT GGTGCTCGAT
TTTCCAAATA TGGGAGCTCA TCCGCTGATA GTTCTTGAAG GAGGAACGAT CGAAAAGTCT
GAGACCTTCA AGCTGTTCGA AAGAGAGTGA
 
Protein sequence
MWPSDSNDHR VTRRNVIIFS SLLLGSLAIL LALLLIRTKD QYYELRDFAL GTSVRIVVSS 
QKINPRTIAE AILEDMKRIT YKFSFTDERS VVKKINDHPN EWVEVDEETY SLIKAACAFA
ELTDGAFDPT VGRLLELWGF TGNYENLRVP SREEIEEALK HIGYKNVLFD DKNMRVMVKN
GVKIDLGGIA KGYALDRTRQ IALSFDENAT GFVEAGGDIR IIGPKFGKYP WVIGVKDPRE
DNVIDYIYLK SGAVATSGDY ERYFVVDGVR YHHILDPSTG YPARGVWSVT IVAEDATTAD
ALSTAGFVMA GKDWRKVVLD FPNMGAHPLI VLEGGTIEKS ETFKLFERE