Gene Teth514_0974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_0974 
Symbol 
ID5875909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp993893 
End bp995191 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content40% 
IMG OID641541330 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001662611 
Protein GI167039626 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000893445 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACAAT TGGAATACGC CCTTTCAGGC ATTGTCACAA AAGAGATGAG AATAGTTGCA 
GAGTATGAAG GAGTAGACGA AGAATTTATT TTAGAAGGAG TTAAAAAAGG AGAAATAGTA
ATACCATCAA ACATCAACCA CAAAAACCTC ATTCCCAAAG GCATAGGAAG GGGTTTATCG
ACAAAAGTAA ATGCCAATAT AGGAACCTCT GATGCATACC CTGAAATTGA AAAAGAAATT
GAAAAATTAA ATGTTGCTGT AAAAGCTGGA GCAGATGCGG TAATGGATTT AAGCACAGGG
GGCGACATTA ACCAATCTCG TAGAAAAATA CTTGAAAATT CCCCTGTCCC TGTAGGCACT
GTCCCCATGT ATCAAGCAGC TGTAGAATCT ATATCCAAAT ACGGTAGCAT TGTAGCTATG
CCTGAAGAAT TCATTTTTGA AGTTATAGAA GAACAAGCAA AAGACGGGGT TGATTTTATT
ACAGTCCACT GTGGTCTAAC ATTTGAATCA TTGAAAAAGC TTAAAGACAA CGGCCGAGTG
ATGGATATAG TAAGCCGCGG TGGCTCCTTT ACAATTGCTT GGATGCTCCA TAACGACAAA
GAAAATCCTT TGTATAAACA TTTTGATAGG CTCCTTGATA TTGCTAAAAA ATATGACATA
ACTCTAAGCT TAGGAGATGG ACTGCGTCCA GGTTGTCTCG AAGACGCTAC AGATAGCGCA
CAAATTCAAG AGCTCATCAT CCTTGGGGAA CTTGTCAAAA GGGCTCGTAA AGCAGGAGTT
CAAGTGATGG TAGAAGGACC CGGGCATGTG CCAATTGACC AAATTGAAGC AAATGTAAAA
CTTCAAAAAC AACTTTGTCA TAATGCTCCT TTTTATGTGC TTGGCCCTAT TGTGACTGAT
ATAGCTCCTG GTTATGACCA CATAACTTCA GCAATCGGAG GAGCAATTGC AGCAGCTTCT
GGTGCTGATT TCCTTTGCTA TGTTACACCC GCTGAACATC TCGGACTTCC AGACAAAGAA
GATGTCAAAG AAGGCGTTAT TGCAGCAAAA ATTGCCGCCC ATGCTGCAGA TATCGCAAAA
GGCGTAAAAG GTGCTAAAGA AAAAGATTTA ACTATGGCTA GAGCTAGAAA AGCCTTAAAC
TGGGATGAGC AAATAAAGCT TTCTATAGAC CCTGATAAAG CTTTTAAATA TCGCATCAAT
AAAAACATAT CTACAGCCAA AACTTGCAGT ATGTGCGGAA AATTCTGCGC TATGAAAATT
GTCAGTGAGT ACCTTGGAAC TTCAGCTATG ACTTGCTAA
 
Protein sequence
MTQLEYALSG IVTKEMRIVA EYEGVDEEFI LEGVKKGEIV IPSNINHKNL IPKGIGRGLS 
TKVNANIGTS DAYPEIEKEI EKLNVAVKAG ADAVMDLSTG GDINQSRRKI LENSPVPVGT
VPMYQAAVES ISKYGSIVAM PEEFIFEVIE EQAKDGVDFI TVHCGLTFES LKKLKDNGRV
MDIVSRGGSF TIAWMLHNDK ENPLYKHFDR LLDIAKKYDI TLSLGDGLRP GCLEDATDSA
QIQELIILGE LVKRARKAGV QVMVEGPGHV PIDQIEANVK LQKQLCHNAP FYVLGPIVTD
IAPGYDHITS AIGGAIAAAS GADFLCYVTP AEHLGLPDKE DVKEGVIAAK IAAHAADIAK
GVKGAKEKDL TMARARKALN WDEQIKLSID PDKAFKYRIN KNISTAKTCS MCGKFCAMKI
VSEYLGTSAM TC