Gene Cthe_1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1803 
Symbol 
ID4809787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2129597 
End bp2130655 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content45% 
IMG OID640107217 
Productcobalamin (vitamin B12) biosynthesis CbiM protein 
Protein accessionYP_001038217 
Protein GI125974307 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0310] ABC-type Co2+ transport system, permease component 
TIGRFAM ID[TIGR00123] cobalamin biosynthesis protein CbiM 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000373388 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATATGG CTGATGCTCT AATTTCACCG GCTGTGGGCG GAACCATGAT GGCTGTTACT 
GCCGGAGTTG CAGCCTACTC AGTCAATAAA ATAAAACAAG ACATGGACGA AAAGAAAATT
CCGTTGATGG GGGTCATGGG AGCATTTGTG TTTGCCGCCC AGATGATTAA CTTTTCAATA
CCGGGAACGG GCTCCAGCGG TCATCTTGGA GGGGGAATGC TTCTTGCCGT GTTGCTTGGG
CCATACGCGG GATTTTTAAC CATGGCGTCG ATTTTGCTGA TTCAGGCATT GTTCTTCGGG
GACGGTGGAC TGCTTGCCTA CGGCTGCAAT GTATTCAATT TGGGCTTTTT CACCTGTTTT
ATATCCTATC CCTGTGTCTA TAAATGGATT ACCCGAAAGG GCATAAATTC AAAAAGAATC
TTTTTGGGGT CGCTGATATC ATCCATAATC GGACTTCAAA TGGGTTCTTT CAGTGTTGTA
ATCGAGACGC TTTTGTCAGG AAAAACGGAG CTGCCTTTCG GAAGCTTTGT ATTGCTGATG
CAGCCTATAC ATCTTGCAAT CGGAGTAGTT GAAGGTCTTG TAACGGCGGC GGTTGTCATT
TTCGTCTGGA GAACAAGACC TGAATTGTTT GAAGGAGCGG ACGAAGGAAA AGTACAGGAC
GGTATATCTG TAAAAAAGCT TTTGGCTGTG TTCTCAGTTG CCGCCATAAT TGTCGGGGGT
GTTCTTTCCT GGTTTGCATC ATCGGCTCCT GACGGTCTTG AATGGTCTGT TGAAAAGACA
GCAGGAACAG CTGAATTGGA AGTCCAGGGC GAAATTTATG AAACATTGTC GCAGATACAG
CAGAAAACCG CTTTTTTACC GGATTATGAT TTTAAAGCAA GATATAGTGA AAGCCGGGAA
GCTTCAGATA ATGAAGATGC CGGGAAAGCT TGGCCTAATG CAAATTTGGG GACCAGTGTT
TCAGGTATTG TCGGAGGAGT ATTGACTCTT GCGTTTACGG TATTCATTGG TTTTGCAATT
AACATACTTA AAAGAATGAA GAAAAATGCA ACTGCGTAA
 
Protein sequence
MHMADALISP AVGGTMMAVT AGVAAYSVNK IKQDMDEKKI PLMGVMGAFV FAAQMINFSI 
PGTGSSGHLG GGMLLAVLLG PYAGFLTMAS ILLIQALFFG DGGLLAYGCN VFNLGFFTCF
ISYPCVYKWI TRKGINSKRI FLGSLISSII GLQMGSFSVV IETLLSGKTE LPFGSFVLLM
QPIHLAIGVV EGLVTAAVVI FVWRTRPELF EGADEGKVQD GISVKKLLAV FSVAAIIVGG
VLSWFASSAP DGLEWSVEKT AGTAELEVQG EIYETLSQIQ QKTAFLPDYD FKARYSESRE
ASDNEDAGKA WPNANLGTSV SGIVGGVLTL AFTVFIGFAI NILKRMKKNA TA