Gene GM21_3606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3606 
Symbol 
ID8138979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4186496 
End bp4187539 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content64% 
IMG OID644871226 
Productcobalamin (vitamin B12) biosynthesis CbiG protein 
Protein accessionYP_003023385 
Protein GI253702196 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2073] Cobalamin biosynthesis protein CbiG 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones129 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGTAG CGATCATCGC CATAACCGGC AACGGCGCCC GCCTGGGGAA AGTGCTGCAA 
GAGGGAATCC CCGATAGCCG GCTCTTCGTG ATCGAGAAGC ATGCCGGGCC TTCCTGCCAC
CCGTTTTCGG AGCCGGTGCC GGCTTTGATC TCGCGCCTTT GGCCGGAATG CCGCGGGTTC
ATCTGCATCA TGGCCACCGG GATCGTGGTC CGCTGCATCG CGCCGCTTCT GCAGGCCAAG
GACCGGGACC CCGCGGTGGT GGTGCTCGAC GACGCAGGGA AGTTCGCGGT CTCGCTTCTG
TCCGGCCATC TGGGAGGTGC CAATGCCCTC GCGAAAAGCT GCGCGTCCCT TACCGGATGC
ACCCCCGTCG TCACCACTGC AACCGATGCC AACGACCTAC CCTCCTTCGA CCTGCTGGCG
CAGGAGAACG GCTGGGTCAT CGACGACCTG TCGCGGGTGA AGGCGCTGAA CGCTCTTCTC
CTTGAGGGAC GGGAGATCGC CGTGGCGGAC CCGACCGGTA GGGTGAGAAG GTACTGCGCG
GGGAGGGGGA ACCTTGTTTT CGTGGCCGAT GCGGAAAAGG CCGCTGCCTC GGGTGCGGCC
GGGTTGCTCC TGGTCACCAA TCGGACGCTT CCCTCCCCGC TCGATAGGCA ACGGACCCTG
GTGCTGCGTC CGGTAAACCT TCATCTAGGC ATCGGCTGCA ACAGGGGCAC TGCCATGGAA
GAGATCGAAG CGGTGGTCAT GGCGAACCTG GAACGGCTGG GTCTTTCCGT AAAGAGTGTC
AAGTGCCTGG CTACGGCAAG GGCCAAGGAG GATGAGGAAG GGCTTCTGGC ATTCGCCGCG
AGGCTGGGAG TGCCGCTTAT CTTTTTCGAC AACGAGGAGT TGAACGGCGT CGCCGTCCCC
TCTCCCCCTT CGGCGCATGC CATGGCCGCC ATCGGCGCGC GCGGCGTCGC AGAACCGGCG
GCGCTGCTGG CATCGGGGGG CGGGACGCTG ATCTTGAAGA AGGTGAAGGA CGGGAATGTC
ACCCTGTCGA TAGCGCAGGG GTAA
 
Protein sequence
MQVAIIAITG NGARLGKVLQ EGIPDSRLFV IEKHAGPSCH PFSEPVPALI SRLWPECRGF 
ICIMATGIVV RCIAPLLQAK DRDPAVVVLD DAGKFAVSLL SGHLGGANAL AKSCASLTGC
TPVVTTATDA NDLPSFDLLA QENGWVIDDL SRVKALNALL LEGREIAVAD PTGRVRRYCA
GRGNLVFVAD AEKAAASGAA GLLLVTNRTL PSPLDRQRTL VLRPVNLHLG IGCNRGTAME
EIEAVVMANL ERLGLSVKSV KCLATARAKE DEEGLLAFAA RLGVPLIFFD NEELNGVAVP
SPPSAHAMAA IGARGVAEPA ALLASGGGTL ILKKVKDGNV TLSIAQG