Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1803 |
Symbol | |
ID | 4809787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2129597 |
End bp | 2130655 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640107217 |
Product | cobalamin (vitamin B12) biosynthesis CbiM protein |
Protein accession | YP_001038217 |
Protein GI | 125974307 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0310] ABC-type Co2+ transport system, permease component |
TIGRFAM ID | [TIGR00123] cobalamin biosynthesis protein CbiM |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000373388 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATATGG CTGATGCTCT AATTTCACCG GCTGTGGGCG GAACCATGAT GGCTGTTACT GCCGGAGTTG CAGCCTACTC AGTCAATAAA ATAAAACAAG ACATGGACGA AAAGAAAATT CCGTTGATGG GGGTCATGGG AGCATTTGTG TTTGCCGCCC AGATGATTAA CTTTTCAATA CCGGGAACGG GCTCCAGCGG TCATCTTGGA GGGGGAATGC TTCTTGCCGT GTTGCTTGGG CCATACGCGG GATTTTTAAC CATGGCGTCG ATTTTGCTGA TTCAGGCATT GTTCTTCGGG GACGGTGGAC TGCTTGCCTA CGGCTGCAAT GTATTCAATT TGGGCTTTTT CACCTGTTTT ATATCCTATC CCTGTGTCTA TAAATGGATT ACCCGAAAGG GCATAAATTC AAAAAGAATC TTTTTGGGGT CGCTGATATC ATCCATAATC GGACTTCAAA TGGGTTCTTT CAGTGTTGTA ATCGAGACGC TTTTGTCAGG AAAAACGGAG CTGCCTTTCG GAAGCTTTGT ATTGCTGATG CAGCCTATAC ATCTTGCAAT CGGAGTAGTT GAAGGTCTTG TAACGGCGGC GGTTGTCATT TTCGTCTGGA GAACAAGACC TGAATTGTTT GAAGGAGCGG ACGAAGGAAA AGTACAGGAC GGTATATCTG TAAAAAAGCT TTTGGCTGTG TTCTCAGTTG CCGCCATAAT TGTCGGGGGT GTTCTTTCCT GGTTTGCATC ATCGGCTCCT GACGGTCTTG AATGGTCTGT TGAAAAGACA GCAGGAACAG CTGAATTGGA AGTCCAGGGC GAAATTTATG AAACATTGTC GCAGATACAG CAGAAAACCG CTTTTTTACC GGATTATGAT TTTAAAGCAA GATATAGTGA AAGCCGGGAA GCTTCAGATA ATGAAGATGC CGGGAAAGCT TGGCCTAATG CAAATTTGGG GACCAGTGTT TCAGGTATTG TCGGAGGAGT ATTGACTCTT GCGTTTACGG TATTCATTGG TTTTGCAATT AACATACTTA AAAGAATGAA GAAAAATGCA ACTGCGTAA
|
Protein sequence | MHMADALISP AVGGTMMAVT AGVAAYSVNK IKQDMDEKKI PLMGVMGAFV FAAQMINFSI PGTGSSGHLG GGMLLAVLLG PYAGFLTMAS ILLIQALFFG DGGLLAYGCN VFNLGFFTCF ISYPCVYKWI TRKGINSKRI FLGSLISSII GLQMGSFSVV IETLLSGKTE LPFGSFVLLM QPIHLAIGVV EGLVTAAVVI FVWRTRPELF EGADEGKVQD GISVKKLLAV FSVAAIIVGG VLSWFASSAP DGLEWSVEKT AGTAELEVQG EIYETLSQIQ QKTAFLPDYD FKARYSESRE ASDNEDAGKA WPNANLGTSV SGIVGGVLTL AFTVFIGFAI NILKRMKKNA TA
|
| |