Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BLD_0132 |
Symbol | thiE |
ID | 6363499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bifidobacterium longum DJO10A |
Kingdom | Bacteria |
Replicon accession | NC_010816 |
Strand | - |
Start bp | 155460 |
End bp | 158213 |
Gene Length | 2754 bp |
Protein Length | 917 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642679272 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001954076 |
Protein GI | 189438995 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0352] Thiamine monophosphate synthase [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC [TIGR00693] thiamine-phosphate pyrophosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.436618 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAATG AATACCCATA CGCCTCCATG CGCGACAGCT TCGACCTGTC CGCGTACTTC GTGGTGGGGC CGGAGGACTG CAAAGGCCGT CCGCTGACCG ACGTGGTGGA TCAGGCTCTG CACGGCGGCG CAACCTTCAT CCAACTGCGC GCCAAGGAGG CCGATGCCTC TGAGCTGACC GACATGGCTC GCGACATCGC GCAAATCATC GAGGACAATG AGAAGTCCGA TTCCGTGGCC TTTGTGATCG ACGACCGTGC CGACGTGGTG TGGCAGGCCC GCCGCAAGGG CATCAAGGTG GACGGTGTGC ACATCGGCCA GACCGACATG GAGCCGCGCG AAGCCCGCGC CCTGCTCGGC GACGAGGCCA TCGTGGGCCT GTCGGCTGAG ACCGAAAGCC TCGTTCGGCT CATCAACGAG CTGCCCGACG GCTGCATCGA CTACATTGGC GCCGGTCCTC TGCACGTCTC CACCACCAAG CCCGAGGCTT CCGTGGGCGG CAACGACGGC TCCGGCAAGA CGCTGGACGC CGCCCAGATC AACACCATCT GCGTTGCCAG CGAATTCCCG GTGGTCGTGG GCGGCGGCGT GACCGCAGCC GATATGGCCA TGCTCGCCGA TACCAAGGCT GCTGGCTGGT TCGTGGTCTC CGCCATCGCG GGTGCCGAGA ACCCGGAAGA GGCCGCGCGC ACCATGGTCG AAGGCTGGAA GGCCGTCCGC GGCGACAAGA AGCACGGCTA CGCTCCGCGC GTCGTAACCC ACACCCCCGC TACTGACACT CAGGCTGCTC AGGAAGGGGC CGCGAAGCCC GGTTCCGAGG CCACTGAGAA GAAGTTCACC AACGCCAAGG ACGCCAAGGA TGCCCAGAAG CTTGCCAAGC AGCAGCGCGT GGACATCGCC GCCCGTGGTT CCAAGCAGCG CGACAAGGCG CACATCCGCA AGACCAAGTC CGTACCGTTC ACCTATCAGT ATGGTTCCTA CGACCTGGAA GTGCCGTACA CCGAAATCAA GCTGAGCGAC ACGCCCGGCG TGGGCCCGAA CCCGCCGTTC CACGACTACA ACACCGAAGG CCCCAAGTGC GACCCGAAGG AAGGCTTGAA GCCCCTGCGT CTCGACTGGA TCCGCGACCG CGGCGACATC GAAGACTACG AGGGCCGCCG CCGCAACCTT GAGGACGATG GCAAGCGCGC CATCAAGCGC GGCCGCGCCA CCAAGGAATG GCGTGGACGC AAGCATGAGC CGATGCGTGC CAAGGACCAC CCGATCACCC AGATGTGGTA CGCCCGCCAC GGCATCATCA CCCCGGAGAT GCAGTACGTG GCCACCCGCG AGAACTGCGA TGTGGAGCTC GTGCGCTCCG AGCTGGCCGC CGGCCGCGCG GTGATGCCTT GCAACATCAA CCACCCCGAG GCCGAGCCGA TGATTATCGG ATCCGCCTTC CTGACCAAGC TCAACGCCAA CATGGGCAAC TCGGCCGTCA CCTCCTCCAT CGATGAGGAA GTGGAGAAGC TGACGTGGGC CACCAAGTGG GGCGCCGATA CCGTGATGGA CCTGTCCACC GGTAACGACA TCCACACCAC CCGTGAGTGG ATTCTTCGCA ACTCCCCCGT GCCGATCGGC ACCGTGCCGA TGTACCAGGC CCTCGAAAAG GTCGAGGATG ATGCCTCCAA GCTGAGCTGG GAACTGTTCC GCGACACCGT GATCGAACAG TGCGAGCAAG GTGTGGATTA CATGACCATC CACGCCGGCG TGCTGCTGCG TTACGTGCCG CTGACCGCCA ACCGCGTGAC CGGCATCGTC TCCCGCGGTG GCTCCATCAT GGCCGACTGG TGCCTGCGAC ACCACCAGGA GAGCTTCCTG TACACGCACT TCGACGAGCT GTGCGACATC TTCGCCAAGT ACGACGTGGC GTTCTCTCTG GGCGATGGCC TGCGTCCTGG CTCCCTGGCC GACGCGAACG ACGCCGCGCA GCTTTCCGAG CTCATGACTC TGGGCGAGCT GACCGAGCGC GCCTGGGCCA AGGACGTGCA GGTGATGATC GAGGGCCCGG GCCACGTGCC GTTCGACACC GTGCGCATGA ACATCGAGCT GGAGAAGGCC GTGTGCCACA ACGCCCCGTT CTACACCCTC GGACCGCTGA CCACCGATAC CGCTCCGGGT TACGATCACA TCACCTCCGC TATCGGCGCC ACCGAGATCG GTCGTTACGG CACCGCTATG CTGTGCTACG TGACCCCGAA GGAACACCTC GGCCTGCCGA ACAAGGATGA TGTGAAGCAG GGCGTGATTG CCTACAAGAT CGCCTGCCAC GCCGCAGACA TCGCCAAGCA CCACCCGCAC GCGATGGATC GCGACAATGC GATTTCCAAG GCTCGCTTCG AGTTCCGTTG GCTCGACCAG TTCAATCTGT CCTACGATCC GGACACCGCC ATCGCCTTCC ACGACGATAC CCTGCCCGCC GAGCCGGCCA AGATGGCGCA CTTCTGCTCG ATGTGCGGCC CGAAGTTCTG CTCGATGGCC ATCTCCCAGA ACATCCGCAA GGCATTCGGC GGCGAGGCCG CTCAGCAGCA AATCGTGAAG GAGGCTGCTG CAGGCATCGA CTCCGAGGCA CTGGCTACGG CCAAGGCCAA TGTCGATAAC GGCGTGGTGT CCGCCAACGT GCTCAGCCCC GAAGAAATCC TCGCCGGCAT GGATGCGATG AGCGAAAAGT ACACCGCCCA GGGCGGCAAG CTCTACTCCA CCGCCCAGGA GTAA
|
Protein sequence | MSNEYPYASM RDSFDLSAYF VVGPEDCKGR PLTDVVDQAL HGGATFIQLR AKEADASELT DMARDIAQII EDNEKSDSVA FVIDDRADVV WQARRKGIKV DGVHIGQTDM EPREARALLG DEAIVGLSAE TESLVRLINE LPDGCIDYIG AGPLHVSTTK PEASVGGNDG SGKTLDAAQI NTICVASEFP VVVGGGVTAA DMAMLADTKA AGWFVVSAIA GAENPEEAAR TMVEGWKAVR GDKKHGYAPR VVTHTPATDT QAAQEGAAKP GSEATEKKFT NAKDAKDAQK LAKQQRVDIA ARGSKQRDKA HIRKTKSVPF TYQYGSYDLE VPYTEIKLSD TPGVGPNPPF HDYNTEGPKC DPKEGLKPLR LDWIRDRGDI EDYEGRRRNL EDDGKRAIKR GRATKEWRGR KHEPMRAKDH PITQMWYARH GIITPEMQYV ATRENCDVEL VRSELAAGRA VMPCNINHPE AEPMIIGSAF LTKLNANMGN SAVTSSIDEE VEKLTWATKW GADTVMDLST GNDIHTTREW ILRNSPVPIG TVPMYQALEK VEDDASKLSW ELFRDTVIEQ CEQGVDYMTI HAGVLLRYVP LTANRVTGIV SRGGSIMADW CLRHHQESFL YTHFDELCDI FAKYDVAFSL GDGLRPGSLA DANDAAQLSE LMTLGELTER AWAKDVQVMI EGPGHVPFDT VRMNIELEKA VCHNAPFYTL GPLTTDTAPG YDHITSAIGA TEIGRYGTAM LCYVTPKEHL GLPNKDDVKQ GVIAYKIACH AADIAKHHPH AMDRDNAISK ARFEFRWLDQ FNLSYDPDTA IAFHDDTLPA EPAKMAHFCS MCGPKFCSMA ISQNIRKAFG GEAAQQQIVK EAAAGIDSEA LATAKANVDN GVVSANVLSP EEILAGMDAM SEKYTAQGGK LYSTAQE
|
| |