Gene Tfu_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTfu_1038 
Symbol 
ID3579668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobifida fusca YX 
KingdomBacteria 
Replicon accessionNC_007333 
Strand
Start bp1216067 
End bp1217335 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content68% 
IMG OID637684733 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_289099 
Protein GI72161442 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.16797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCCT CGCTCGACTC CACCACCGTC GACGCGCGTG CGGACGGCGC GGTCGGGACA 
GCCGCGCAGG AGCTGGGCGA GCTGTGTGTG CTCCTGAAGC TCGGGGAGAT CGTGCTCAAG
GGCAGCAACC GGCACCTGTT CGAGCGCCGT CTGCAGAACA ACATCCGCGC CGCCGCTAAA
GGCCTGCCCG AATTCCGGCT CTCCCAGCGC AAAGGCATCA TCATGCTGCG CATGCCCGGT
GCCTCCGACC TCGAAGTGGC CCAGCTGGCC GAGCGCATGC GCAACGTCAT GGGGATCGTG
TGGGTGCACC TGGTGCGCCG GGTCGCCAAA GACGTGGACA CCGTCACCCA GGTCGCGGTG
CAGGCCATGG CCGACCGCTC CGGCTCTTTC GCGGTGCGGG CCCGCCGCCG GGACAAGCGT
TTCCCCATGA TCTCCTCGGA GCTGGCCGGG CACGTGGGCG CCGCGATCAA ACGCGCCTAC
CCCCACCTGA CCGTCAACCT CTCCCAGCCC GACCACACGG TCCAGATCGA GGTCGACAAA
GACGAGGTGT TCGTCTTCAC CGACGGCATC CCCGGCCAGG GCGGCCTCCC GGTAGGCATG
AGTGGACGCG GCCTGGTCCT GCTGTCCGGC GGGATCGACT CCCCGGTCGC GGCCTACCGC
ATGATGCGCC GCGGGCTCCG GCTCGACTTC CTGCACTTCT CCGGCATGCC GTTCACCGGA
CCGGAGTCCA TCTACAAGGC GTACAGCCTG GTCCGGCAGC TCGACCGCTT CCAGAACGGC
TCCCGGCTGT ACGTGGTGCC GTTCGGCAAG GCCCAGCAGC AGCTCCGCAA CTCCGGCGCG
GAACGGCTGC AGATCGTCGC CCAGCGGCGC CTCATGCTCA AAACCGCGGA AGCCCTTGCC
GACCGGCTCA ACGCCAGCGC GCTGGTCACC GGGGACGCCC TCGGCCAAGT GTCCAGCCAG
ACCCTGGCCA ACATCACCGC CCTGGACGAC GCTGTGGACC TGCCGATCCT GCGTCCGCTG
ATCGGCATGG ACAAGACCGA GATCATGGAC CAGGCCCGCG CCATCGGCAC GCTCACCATT
TCCGAGCTGC CCGACGAGGA CTGCTGCACC ATGCTCACCC CGCGGCAGGT GGAGACTGCG
GCCAAGATCG CCGATCTGCG GCAGATCGAG AAGCGACTGG ACGTTGAAGA ACTCGCCGAA
CACCTGGCCA GCAACGTGCA ACTGCACCGC CCCAGCTTCC TGGGCGAGGA GGAGAACACC
GCAGCCTGA
 
Protein sequence
MSASLDSTTV DARADGAVGT AAQELGELCV LLKLGEIVLK GSNRHLFERR LQNNIRAAAK 
GLPEFRLSQR KGIIMLRMPG ASDLEVAQLA ERMRNVMGIV WVHLVRRVAK DVDTVTQVAV
QAMADRSGSF AVRARRRDKR FPMISSELAG HVGAAIKRAY PHLTVNLSQP DHTVQIEVDK
DEVFVFTDGI PGQGGLPVGM SGRGLVLLSG GIDSPVAAYR MMRRGLRLDF LHFSGMPFTG
PESIYKAYSL VRQLDRFQNG SRLYVVPFGK AQQQLRNSGA ERLQIVAQRR LMLKTAEALA
DRLNASALVT GDALGQVSSQ TLANITALDD AVDLPILRPL IGMDKTEIMD QARAIGTLTI
SELPDEDCCT MLTPRQVETA AKIADLRQIE KRLDVEELAE HLASNVQLHR PSFLGEEENT
AA