Gene GM21_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2002 
Symbol 
ID8137336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2322626 
End bp2323693 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content64% 
IMG OID644869615 
Productcobalamin (vitamin B12) biosynthesis CbiM protein 
Protein accessionYP_003021812 
Protein GI253700623 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0310] ABC-type Co2+ transport system, permease component 
TIGRFAM ID[TIGR00123] cobalamin biosynthesis protein CbiM 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.3837300000000002e-24 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACATGG CGGACGCTTT GCTTTCCCCG GCGGTGGGAG GGACCATGTG GGCGGTCTCG 
GCAGGGACAA TCGCCCTTAG TTCCGCCCGC CTGCGCCGGG AGCAGGACGA CCGCCAGGCG
CCTCTGATGG GTGTGCTGGG AGCCTTCCTT TTCGCCGCGC AGATGATCAA CTTCTCCATA
CCCGGCACCG GTTCCAGCGG ACACTTAAGC GGCGGTTTGC TGCTGGCGGT ACTGCTCGGC
CCTTCGGCCT CCTTCCTGAC CCTCGCCTCG GTGCTGGTAG TGCAGGCTCT GTTCTTTGCC
GACGGAGGGC TCCTCGCTCT TGGCTGCAAC ATCTTCAACC TGGGCGTCAT CCCATCCTTT
CTGGTCTACC CGTTCTTGTA CCGCATCCTC TCGAAGGGGG ACAGGCACCT TGCGGAGCCG
GTCCCCGGCG GCACGCCTGG TAATTTGCGG GAAACCGCGG CCGTCATGGT AGCCACGCTG
GTGGCCATGC AGCTAGGCTC CCTGGCCGTC GTGCTGGAAA CCGGTTTTTC GGGTATCTCT
TCGCTTCCTC TGGAGCGTTT CCTGTTGTTG ATGCAGCCCA TACATCTCGC CATCGGAGCG
GTCGAGGGGG CGGTGACAGT CGCCATCCTT TCCTTCGTGC GCAAGGCGCG CCCGGAACTG
TTGCAGCGTG ATCAGGCTGG TAATGTCGGA CGCTCCCGCT TCGCGGTTTT GCTCGCCTTC
CTGGTCCTCA CCCTGGTCAC CGGCGGCCTC CTTTCCCCGT TGGCTTCCAA AAACCCCGAC
GGGCTGGAAT GGTCGCTCTC CAAAGTGGGT GGCGACGCGG TCGTTCCCGG TGCGGGAGAG
GGGATGCACG GTCTTCTCGC TCACCTGCAG GAAAAAAGCG CATGGTTCCC CGATTACGTC
GTGAAGCGTG CCGCGCCCCG TCCGCTGCCA AACGGTGCCG TCGATCTGCC TGCTGCCGCG
AGCGCCGTTC CGGGAGTAGT CGGCACCCTC CTCACCCTTG CCCTTATCTG CGTCGCCGGG
GCGCTGTTGA AGAGAGGAAA GCAAAGGGCG GATATACCCG ATGCCTGA
 
Protein sequence
MHMADALLSP AVGGTMWAVS AGTIALSSAR LRREQDDRQA PLMGVLGAFL FAAQMINFSI 
PGTGSSGHLS GGLLLAVLLG PSASFLTLAS VLVVQALFFA DGGLLALGCN IFNLGVIPSF
LVYPFLYRIL SKGDRHLAEP VPGGTPGNLR ETAAVMVATL VAMQLGSLAV VLETGFSGIS
SLPLERFLLL MQPIHLAIGA VEGAVTVAIL SFVRKARPEL LQRDQAGNVG RSRFAVLLAF
LVLTLVTGGL LSPLASKNPD GLEWSLSKVG GDAVVPGAGE GMHGLLAHLQ EKSAWFPDYV
VKRAAPRPLP NGAVDLPAAA SAVPGVVGTL LTLALICVAG ALLKRGKQRA DIPDA