Gene Tmz1t_3740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3740 
SymbolcbiD 
ID7873738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4110578 
End bp4111744 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content73% 
IMG OID643700685 
Productcobalt-precorrin-6A synthase 
Protein accessionYP_002890709 
Protein GI237654395 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1903] Cobalamin biosynthesis protein CbiD 
TIGRFAM ID[TIGR00312] cobalamin biosynthesis protein CbiD 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.72099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGCAG GACATTCCCT CCCCGACAAG GTGCGCAAGG GCGACGCCAG GCGCTCGCGC 
GGCAATCGTA CCGGCTTCAC CACCGGCGCC AACTCGGCCG CGGCCGCGGC GGCCGCCACG
CTCGGCCTGG TGCGCGGCGC GGTGCCCGAC GCGGTGGAGT GCGTGCTGCC CAACACGACG
CGGGTGCGCT TCACGATCCG CGACGGCCAG GTGGACGGCG ACCACGCCCA CGCGGTCTCG
ATCAAGGACG CCGGCGACGA TCCCGATGCC ACCCATGGCG CCCGTCTCAC CGCCGACGTG
CGCCGCATCC GCGGGGGCGG CGGCGTGGTG ATCCTCGCGG GCGGCCCTGG CGTGGGCGTG
GTCACCAAGC CCGGGCTCGG GCTGGCGGTC GGCGGCCCCG CGATCAACCC CGTGCCGCGG
CGCAACATCA TCGACAACGT GCGCGCGGCC GGCACGCCCA TCCTCGAGGC CGGCGACGGT
CTGGAGGTGA CGATCTCGGT GCCCGGCGGC GAGGAGATTG CGCGGAAGAC GCTCAATGCC
CGCCTCGGCA TCCTCGGCGG CATCAGCATC CTCGGCACCA CCGGCATCGT CCGCCCGTAT
TCCACCGCCG CCTTCCGCGC CAGCGTGATC CAGGCCATCG ATGTCGCCGC CAACCAGGGC
CAGACCTGCG TGGTGTTCAC CACCGGTGGG CGCACCGAGA AATGCGCGAT GCGCGCCTTC
CCGGACCTCG ACGAGGCCTG CTTCGTGCAG ATGGGCGACT TCGTCAAGGC CGCCTTCACC
ACCGCGGTGA GGCAGGGCAT GCGCCACATC GTCGTCGGCG CCATGATCGG CAAGCTCACC
AAGATCGCCC AGGGCCTGTC GGTCACCCAC GCCTGGCGCG AGGAGGTCGA TCGCGAGCTG
ATCGCCGCCG CCGCTGCCGA GGTCGGCGCG CCGCCCGCGC TCGTGGCCGA GATCCGCGCC
GCCGAGACCG CCCGCTTCGC CGCCGAACGC CTGAGCGCGC TCGGCCTGGC CGTGGCCTTC
CACCGCGCGC TCGCCGGGCG CGCCATCCGC AGCCTGCGCC AGCGCTACCC CGGCCCGCAC
CGGCTCACCG TGCTGGCGTG CAACTTCGAG GGCGTGCCGA TCGTGAGCGT CGATGAGGCC
GACCTGAAGG AGACCACGCA TGCCTGA
 
Protein sequence
MAAGHSLPDK VRKGDARRSR GNRTGFTTGA NSAAAAAAAT LGLVRGAVPD AVECVLPNTT 
RVRFTIRDGQ VDGDHAHAVS IKDAGDDPDA THGARLTADV RRIRGGGGVV ILAGGPGVGV
VTKPGLGLAV GGPAINPVPR RNIIDNVRAA GTPILEAGDG LEVTISVPGG EEIARKTLNA
RLGILGGISI LGTTGIVRPY STAAFRASVI QAIDVAANQG QTCVVFTTGG RTEKCAMRAF
PDLDEACFVQ MGDFVKAAFT TAVRQGMRHI VVGAMIGKLT KIAQGLSVTH AWREEVDREL
IAAAAAEVGA PPALVAEIRA AETARFAAER LSALGLAVAF HRALAGRAIR SLRQRYPGPH
RLTVLACNFE GVPIVSVDEA DLKETTHA