Gene GM21_1951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1951 
Symbol 
ID8137285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2260842 
End bp2261822 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content66% 
IMG OID644869565 
Productmolybdenum cofactor biosynthesis protein A 
Protein accessionYP_003021762 
Protein GI253700573 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2896] Molybdenum cofactor biosynthesis enzyme 
TIGRFAM ID[TIGR02666] molybdenum cofactor biosynthesis protein A, bacterial 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones132 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTGA TAGACACCTA CGGCAGGCGC ATCAACTACC TCCGGCTCTC CGTCACCGAC 
CGCTGCAACC TCCGTTGCCG CTACTGCATG CCGGAAGAGG GGGTGGAGAA GCTGGACCAC
TCGCAAGTAC TCTCCTACGC CGACCTCTTG CGCATCTCGA CCGAGGCGGT CGCCGCCGGC
ATCGAAAAGA TCCGCGTCAC CGGCGGCGAG CCGCTGGTCC GCAAGGGGAT AATTTCCTTC
CTGGAGCGCC TCGGCGCTCT TCCCGGGCTC AAGGAGCTGG TGCTGACCAC GAACGGCCTG
CTCCTGAAGG AGATGGCTCA AGGGCTGCGG GAAGCGGGGG TGCAGCGCCT GAACATCAGC
CTCGATTCCC TGAAGCCCGA GATCTTCGCT AGCATCACCC GAGGCGGGGA GCTTAAGCGC
GTCTTGGACG GGCTGGAGGC CGCCGAGAAG GCCGGATTCC CGCCGCACAA GATCAACGTG
GTGGTGATGC GCGGCATAAA CGACGACGAG ATCCTCGACT TCGTGGAACT CACCATGAAG
CGCCCCTACG CGGTCCGGTT CATCGAGTAC ATGCCCACCT GCGGCGACGC CGACTGGCGC
GAACTCTGCG TGCCGGGTGC CGAGATCCGG GAGCGCATCG GCGCCCGTTA CCTCATCGAG
GAGGTGAAAA ACAGCGAGCT CTCCGGCCCC TCGAAGAACT TCAAGGTGCA GGGGTCGCAG
GGGTCCCTCG GGATCATCAC CGCCATGACC GGGCATTTTT GCAACGGCTG CAACCGGTTG
CGGGTGACCG CCTCGGGGAT CGCCAAGGGG TGCCTCTTCT CGGGCGAGGG GGTGGACCTG
CGGCCGGTCC TGGCCACAGG CGACGACGCG CTTTTGCGCC GGGAGGTGCA GCGCATCGTG
GCGGCGAAAC CGGGCCGGCA CGAGGTGAGC GACGAGGGCG CGGAAACGGT CCCCTTCGCC
ATGTCCAGGG TCGGCGGGTG A
 
Protein sequence
MALIDTYGRR INYLRLSVTD RCNLRCRYCM PEEGVEKLDH SQVLSYADLL RISTEAVAAG 
IEKIRVTGGE PLVRKGIISF LERLGALPGL KELVLTTNGL LLKEMAQGLR EAGVQRLNIS
LDSLKPEIFA SITRGGELKR VLDGLEAAEK AGFPPHKINV VVMRGINDDE ILDFVELTMK
RPYAVRFIEY MPTCGDADWR ELCVPGAEIR ERIGARYLIE EVKNSELSGP SKNFKVQGSQ
GSLGIITAMT GHFCNGCNRL RVTASGIAKG CLFSGEGVDL RPVLATGDDA LLRREVQRIV
AAKPGRHEVS DEGAETVPFA MSRVGG