Gene Cthe_3150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3150 
Symbol 
ID4809713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3722858 
End bp3723913 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content39% 
IMG OID640108583 
Productadenosylcobinamide-phosphate synthase 
Protein accessionYP_001039538 
Protein GI125975628 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1270] Cobalamin biosynthesis protein CobD/CbiB 
TIGRFAM ID[TIGR00380] cobalamin biosynthesis protein CobD 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000125584 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTGT TTTTTCTTTT GGATGTGTTT GTTGCATTTT TCGCGGATTT TGTTATTGGC 
CATCCGGATT GGTTGCCTCG CCCGGAAAAG TTCATCGAAT GGATTGGAAA ATATGTTGAG
AATATAATGC GCAGAATTAT CAATATATCA TCTTCAAAAA AAGTCAAAGC ATTGGGAGAG
GATATTGTTC GCAGTACCAG TAAAACATAC AGAAACGAAA GAATTGCCGG AGTGGCTTTT
GTTCTGGTAA TGACGACTCT TGTAGCTTTA ATTGTTGCTG CTTTGCTGGA ACTGTCAATG
TTTATAGACC CTATACTGTT TCATGCTGTA AACACCTGTT TGATATATTT GTCTTTCGGG
TCGAGAGCTG TTGCAAAAGA AAGTTACAAG GTATTTGACG CGTTAAAAGA AAGGGATGTG
TTCAAAGCCA GAAATATGCT CGCTGCCGTA ATAGGGATAA AAACTGAAAA TCTTGACGAA
AAGGAAATTA TAAAAAGGAC GGTGGAGTCA ACGGCTGAAG ATACAGCAGA CAGAGTAATA
TCCCCGATTT TTTATGCATC TTTGGCTTCA TTTTTTAGTT TAGGTGCTAC AATAGTTTGG
ATTTATAAAA CCATAAATAT TTTAGACCGA ATGGTGGGCT ATAAAAATGA CGAATACAGG
CATTTCGGCT GGGCAACGGC AAAGCTTGAC GACATTGTGA ATTTCATACC TGCAAGGCTG
ACAGGAATAT TGATAGTTGC AGGTGCTTTT TTAACCGGAA AAGAATACAA AAACAGTTAT
TCTATTATGA TGAGGGACAG GAAAAAACAT GCAAGCCCGA ATTCAGGCTA TCCCGAGGCC
GCTGTGGCGG GAGCGTTGGG AATAAGGCTT GGAACGGAAG TGTTGCGTTT GGGCGATATT
GTTGAAAAGC CCGCAATTGG TGATGATATA AATGAACTGG ATATTAAAGC CATTTCTCAG
ACGGTCAGCC TGATGTACGC TGCATCGTTT ATTGCCTTGT TGCTGATGGA AGCTTTAGGA
CTTTTGATAT TTGTGTTTTA TAACTATGTA TATTAA
 
Protein sequence
MSLFFLLDVF VAFFADFVIG HPDWLPRPEK FIEWIGKYVE NIMRRIINIS SSKKVKALGE 
DIVRSTSKTY RNERIAGVAF VLVMTTLVAL IVAALLELSM FIDPILFHAV NTCLIYLSFG
SRAVAKESYK VFDALKERDV FKARNMLAAV IGIKTENLDE KEIIKRTVES TAEDTADRVI
SPIFYASLAS FFSLGATIVW IYKTINILDR MVGYKNDEYR HFGWATAKLD DIVNFIPARL
TGILIVAGAF LTGKEYKNSY SIMMRDRKKH ASPNSGYPEA AVAGALGIRL GTEVLRLGDI
VEKPAIGDDI NELDIKAISQ TVSLMYAASF IALLLMEALG LLIFVFYNYV Y