Gene GM21_2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2202 
Symbol 
ID8137538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2572832 
End bp2573881 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content63% 
IMG OID644869817 
Productcytochrome bd ubiquinol oxidase subunit II 
Protein accessionYP_003022012 
Protein GI253700823 
COG category[C] Energy production and conversion 
COG ID[COG1294] Cytochrome bd-type quinol oxidase, subunit 2 
TIGRFAM ID[TIGR00203] cytochrome d oxidase, subunit II (cydB) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0000000104234 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGATC CGGTAATTCT CGCCGCGATG GTCCTCGTTG CCGGCCTGCT CATGTACGCG 
CTCTTCGGCG GGGCCGACTT CGGCGGCGGC ATCTGGACCG CTCTCGCTTT CGGGCCGCGC
GCGCGGGAGC AGAGGGAAGC GCTTTTCAAC GCCATCGGTC CTGTGTGGGA GACGAACCAC
GTCTGGCTTA TCTTCGTTAT CGTCACCCTC TTCACCGCCT TTCCCGCCGG TTTCGGTAAC
CTCTTCATTG TGCTGATGAC ACCTTTGGTG CTGGCCTTGG TGGGGATTAA CTTCAGGGGC
GCCGCTTTTG CCTTCCGCCA TTTCGGCAGG GAACTGAAGA AGGAGACCCC GGTCAGCGCC
CGCGTGTTCG AGATCGCCAG CGTGCTCACG CCGTTCACCC TGGGGCTTGC GGTATCCGCC
ACTGCGGCCG GGAGGATCGT CATCGCCGGG CGCATGCCAA CCGAACAGTT TTCAGACTGG
CTCAACTCCT TCACCCTTTT GGGGGGCGTG GTCGGCATGG CGATCTGCGC CTATCTGGCA
CCCATTTATA TGACGGTGCG CGTCACGGGG GAGCTGCGGG AGGACTTCCG GAAGGAGTCG
CTGGCGGCAG GGCTCGCCCT GGGCATTCTC ACCTCGCTGA TGATCCCGCT GGCGCACTAC
CAGGCGCCGC TTTTCGCCGA GAGGCTGTTC AACTCGTGGC CCATGCTGTT CGTGATGCTG
GCGATACTTG CCGGCGTCGT CACCGAGTCT TTGCTCTGGC TGCGGCGCTA TTTCTGGGCG
CAGCTTATAG CCGGCGCTAC CATCGTCTTC ACCATGGCTG GCTTTGTCGC TGCGCTCAAC
CCCGACATCC TGATCGGGCA ACTGACTCTG CGTGCCGCCG CGGCGCCGCA TCCGACCCTG
GTCGCCTTTC TCGCCGTCCT GCCGATAGGG GCTTTGATCC TGGTCCCTTC GCTCGTTTAC
CTGTACTGGA CTTTTCGGGG TGAGCCGTCT GCCGATATGC CGCCGGCTGG AAAGGCCGGG
AGGGGAGGGG GACAGGGCGA GGAATCGTGA
 
Protein sequence
MADPVILAAM VLVAGLLMYA LFGGADFGGG IWTALAFGPR AREQREALFN AIGPVWETNH 
VWLIFVIVTL FTAFPAGFGN LFIVLMTPLV LALVGINFRG AAFAFRHFGR ELKKETPVSA
RVFEIASVLT PFTLGLAVSA TAAGRIVIAG RMPTEQFSDW LNSFTLLGGV VGMAICAYLA
PIYMTVRVTG ELREDFRKES LAAGLALGIL TSLMIPLAHY QAPLFAERLF NSWPMLFVML
AILAGVVTES LLWLRRYFWA QLIAGATIVF TMAGFVAALN PDILIGQLTL RAAAAPHPTL
VAFLAVLPIG ALILVPSLVY LYWTFRGEPS ADMPPAGKAG RGGGQGEES