Gene GM21_3610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3610 
Symbol 
ID8138983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4190825 
End bp4191895 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content65% 
IMG OID644871230 
Productcobalamin biosynthesis protein CbiD 
Protein accessionYP_003023389 
Protein GI253702200 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1903] Cobalamin biosynthesis protein CbiD 
TIGRFAM ID[TIGR00312] cobalamin biosynthesis protein CbiD 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones124 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGA AAGAACTCAG ATACGGTTTC ACCACCGGGG CATGCGCCGC CGCCGCCGTC 
AAGGCGGCCG CCCAGATGCT GCGCGACCAG GCGATGGTTC GCGAGGTTGA ACTCATGCTT
CCCTGCGGCA TCGGCGCGAA CTTCCAGGTC CACGGCGGGG TGTTGCGCGA CAACACGGCT
TCGTGCTACG TCGTGAAGGA TGCCGGCGAC GACCCGGACG TGACCAACGG CGCCGAGATC
CACGTCACCG CCAGCATCGA GTTCTTCACC AAGAACGAGA TAAAGATCGA GGGGGGGACC
GGCATCGGCC GGGTCACCAA GCCGGGGCTC GCGGTCCCGG TGGGCGCGTG GGCGATAAAT
CCGGTACCGC GCAGCATGAT CCTGGAAGTG GTGAAGGAGG TATTCGCGCT GCGCTGCATT
CCGGCGACGC TCACCTTCAG CATCAGCATC CCCAACGGCG AGGAACTGGC GAAGAGGACC
CTCAACGAGC GGCTCGGCAT CGTCGGCGGG CTCTCCATCC TCGGGACCAC CGGAATCGTC
AAGCCGATCT CGGCCAAGGC CTGGACCGAC ACGGTGGACG CCTCGGTCGA CGTGGCTTTG
GCCTGCGGCG CGCGTACCGT CGTCCTTGCC ACAGGGAGGA GTTCCGAGAT CGTGGCGCAG
AAGCACCTTT CCCTGAGCGA GGAGGCCTTC GTCATGATGG GGGACCACTT CGGCTACGCG
ATGCGGAGTT GCGCCAGCAA GGGGGTTCCG GAAGTCGTTG TCGCCGGGCA GTTCGCCAAG
CTGGTGAAGA TCGCCTGCGG TCACGAGCAG ACCCACGTGA CCTCGTCCCA GATGGACCTG
GATGCTCTGG CCTGGTGGCT GAGGGAGGTG CCGGCGACGG CGCACCTGGA GCAGATGGCG
CGCGAGGCGA ACACGGCGCG ACACCTGCTT GAGGCGTCGG GGTACAACAA GGCCCTCATC
GAACTGGTCT GCTCCCGTGT GCTCAAGGTC TGCGCCGATG TGGCGCCCTG GATGAAGGCG
CGGGTGATGC TGGCGGGATA CCACGGCGAT CTTTTGTACT TTTCCCCGTA G
 
Protein sequence
MSGKELRYGF TTGACAAAAV KAAAQMLRDQ AMVREVELML PCGIGANFQV HGGVLRDNTA 
SCYVVKDAGD DPDVTNGAEI HVTASIEFFT KNEIKIEGGT GIGRVTKPGL AVPVGAWAIN
PVPRSMILEV VKEVFALRCI PATLTFSISI PNGEELAKRT LNERLGIVGG LSILGTTGIV
KPISAKAWTD TVDASVDVAL ACGARTVVLA TGRSSEIVAQ KHLSLSEEAF VMMGDHFGYA
MRSCASKGVP EVVVAGQFAK LVKIACGHEQ THVTSSQMDL DALAWWLREV PATAHLEQMA
REANTARHLL EASGYNKALI ELVCSRVLKV CADVAPWMKA RVMLAGYHGD LLYFSP