Gene BLD_0132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBLD_0132 
SymbolthiE 
ID6363499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBifidobacterium longum DJO10A 
KingdomBacteria 
Replicon accessionNC_010816 
Strand
Start bp155460 
End bp158213 
Gene Length2754 bp 
Protein Length917 aa 
Translation table11 
GC content64% 
IMG OID642679272 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001954076 
Protein GI189438995 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase
[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC
[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.436618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATG AATACCCATA CGCCTCCATG CGCGACAGCT TCGACCTGTC CGCGTACTTC 
GTGGTGGGGC CGGAGGACTG CAAAGGCCGT CCGCTGACCG ACGTGGTGGA TCAGGCTCTG
CACGGCGGCG CAACCTTCAT CCAACTGCGC GCCAAGGAGG CCGATGCCTC TGAGCTGACC
GACATGGCTC GCGACATCGC GCAAATCATC GAGGACAATG AGAAGTCCGA TTCCGTGGCC
TTTGTGATCG ACGACCGTGC CGACGTGGTG TGGCAGGCCC GCCGCAAGGG CATCAAGGTG
GACGGTGTGC ACATCGGCCA GACCGACATG GAGCCGCGCG AAGCCCGCGC CCTGCTCGGC
GACGAGGCCA TCGTGGGCCT GTCGGCTGAG ACCGAAAGCC TCGTTCGGCT CATCAACGAG
CTGCCCGACG GCTGCATCGA CTACATTGGC GCCGGTCCTC TGCACGTCTC CACCACCAAG
CCCGAGGCTT CCGTGGGCGG CAACGACGGC TCCGGCAAGA CGCTGGACGC CGCCCAGATC
AACACCATCT GCGTTGCCAG CGAATTCCCG GTGGTCGTGG GCGGCGGCGT GACCGCAGCC
GATATGGCCA TGCTCGCCGA TACCAAGGCT GCTGGCTGGT TCGTGGTCTC CGCCATCGCG
GGTGCCGAGA ACCCGGAAGA GGCCGCGCGC ACCATGGTCG AAGGCTGGAA GGCCGTCCGC
GGCGACAAGA AGCACGGCTA CGCTCCGCGC GTCGTAACCC ACACCCCCGC TACTGACACT
CAGGCTGCTC AGGAAGGGGC CGCGAAGCCC GGTTCCGAGG CCACTGAGAA GAAGTTCACC
AACGCCAAGG ACGCCAAGGA TGCCCAGAAG CTTGCCAAGC AGCAGCGCGT GGACATCGCC
GCCCGTGGTT CCAAGCAGCG CGACAAGGCG CACATCCGCA AGACCAAGTC CGTACCGTTC
ACCTATCAGT ATGGTTCCTA CGACCTGGAA GTGCCGTACA CCGAAATCAA GCTGAGCGAC
ACGCCCGGCG TGGGCCCGAA CCCGCCGTTC CACGACTACA ACACCGAAGG CCCCAAGTGC
GACCCGAAGG AAGGCTTGAA GCCCCTGCGT CTCGACTGGA TCCGCGACCG CGGCGACATC
GAAGACTACG AGGGCCGCCG CCGCAACCTT GAGGACGATG GCAAGCGCGC CATCAAGCGC
GGCCGCGCCA CCAAGGAATG GCGTGGACGC AAGCATGAGC CGATGCGTGC CAAGGACCAC
CCGATCACCC AGATGTGGTA CGCCCGCCAC GGCATCATCA CCCCGGAGAT GCAGTACGTG
GCCACCCGCG AGAACTGCGA TGTGGAGCTC GTGCGCTCCG AGCTGGCCGC CGGCCGCGCG
GTGATGCCTT GCAACATCAA CCACCCCGAG GCCGAGCCGA TGATTATCGG ATCCGCCTTC
CTGACCAAGC TCAACGCCAA CATGGGCAAC TCGGCCGTCA CCTCCTCCAT CGATGAGGAA
GTGGAGAAGC TGACGTGGGC CACCAAGTGG GGCGCCGATA CCGTGATGGA CCTGTCCACC
GGTAACGACA TCCACACCAC CCGTGAGTGG ATTCTTCGCA ACTCCCCCGT GCCGATCGGC
ACCGTGCCGA TGTACCAGGC CCTCGAAAAG GTCGAGGATG ATGCCTCCAA GCTGAGCTGG
GAACTGTTCC GCGACACCGT GATCGAACAG TGCGAGCAAG GTGTGGATTA CATGACCATC
CACGCCGGCG TGCTGCTGCG TTACGTGCCG CTGACCGCCA ACCGCGTGAC CGGCATCGTC
TCCCGCGGTG GCTCCATCAT GGCCGACTGG TGCCTGCGAC ACCACCAGGA GAGCTTCCTG
TACACGCACT TCGACGAGCT GTGCGACATC TTCGCCAAGT ACGACGTGGC GTTCTCTCTG
GGCGATGGCC TGCGTCCTGG CTCCCTGGCC GACGCGAACG ACGCCGCGCA GCTTTCCGAG
CTCATGACTC TGGGCGAGCT GACCGAGCGC GCCTGGGCCA AGGACGTGCA GGTGATGATC
GAGGGCCCGG GCCACGTGCC GTTCGACACC GTGCGCATGA ACATCGAGCT GGAGAAGGCC
GTGTGCCACA ACGCCCCGTT CTACACCCTC GGACCGCTGA CCACCGATAC CGCTCCGGGT
TACGATCACA TCACCTCCGC TATCGGCGCC ACCGAGATCG GTCGTTACGG CACCGCTATG
CTGTGCTACG TGACCCCGAA GGAACACCTC GGCCTGCCGA ACAAGGATGA TGTGAAGCAG
GGCGTGATTG CCTACAAGAT CGCCTGCCAC GCCGCAGACA TCGCCAAGCA CCACCCGCAC
GCGATGGATC GCGACAATGC GATTTCCAAG GCTCGCTTCG AGTTCCGTTG GCTCGACCAG
TTCAATCTGT CCTACGATCC GGACACCGCC ATCGCCTTCC ACGACGATAC CCTGCCCGCC
GAGCCGGCCA AGATGGCGCA CTTCTGCTCG ATGTGCGGCC CGAAGTTCTG CTCGATGGCC
ATCTCCCAGA ACATCCGCAA GGCATTCGGC GGCGAGGCCG CTCAGCAGCA AATCGTGAAG
GAGGCTGCTG CAGGCATCGA CTCCGAGGCA CTGGCTACGG CCAAGGCCAA TGTCGATAAC
GGCGTGGTGT CCGCCAACGT GCTCAGCCCC GAAGAAATCC TCGCCGGCAT GGATGCGATG
AGCGAAAAGT ACACCGCCCA GGGCGGCAAG CTCTACTCCA CCGCCCAGGA GTAA
 
Protein sequence
MSNEYPYASM RDSFDLSAYF VVGPEDCKGR PLTDVVDQAL HGGATFIQLR AKEADASELT 
DMARDIAQII EDNEKSDSVA FVIDDRADVV WQARRKGIKV DGVHIGQTDM EPREARALLG
DEAIVGLSAE TESLVRLINE LPDGCIDYIG AGPLHVSTTK PEASVGGNDG SGKTLDAAQI
NTICVASEFP VVVGGGVTAA DMAMLADTKA AGWFVVSAIA GAENPEEAAR TMVEGWKAVR
GDKKHGYAPR VVTHTPATDT QAAQEGAAKP GSEATEKKFT NAKDAKDAQK LAKQQRVDIA
ARGSKQRDKA HIRKTKSVPF TYQYGSYDLE VPYTEIKLSD TPGVGPNPPF HDYNTEGPKC
DPKEGLKPLR LDWIRDRGDI EDYEGRRRNL EDDGKRAIKR GRATKEWRGR KHEPMRAKDH
PITQMWYARH GIITPEMQYV ATRENCDVEL VRSELAAGRA VMPCNINHPE AEPMIIGSAF
LTKLNANMGN SAVTSSIDEE VEKLTWATKW GADTVMDLST GNDIHTTREW ILRNSPVPIG
TVPMYQALEK VEDDASKLSW ELFRDTVIEQ CEQGVDYMTI HAGVLLRYVP LTANRVTGIV
SRGGSIMADW CLRHHQESFL YTHFDELCDI FAKYDVAFSL GDGLRPGSLA DANDAAQLSE
LMTLGELTER AWAKDVQVMI EGPGHVPFDT VRMNIELEKA VCHNAPFYTL GPLTTDTAPG
YDHITSAIGA TEIGRYGTAM LCYVTPKEHL GLPNKDDVKQ GVIAYKIACH AADIAKHHPH
AMDRDNAISK ARFEFRWLDQ FNLSYDPDTA IAFHDDTLPA EPAKMAHFCS MCGPKFCSMA
ISQNIRKAFG GEAAQQQIVK EAAAGIDSEA LATAKANVDN GVVSANVLSP EEILAGMDAM
SEKYTAQGGK LYSTAQE