Gene GM21_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0101 
Symbol 
ID8135404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp122683 
End bp123672 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content64% 
IMG OID644867721 
Productcytochrome C oxidase mono-heme subunit/FixO 
Protein accessionYP_003019945 
Protein GI253698756 
COG category[C] Energy production and conversion 
COG ID[COG2993] Cbb3-type cytochrome oxidase, cytochrome c subunit 
TIGRFAM ID[TIGR00781] cytochrome c oxidase, cbb3-type, subunit II 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.000967148 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGATGA CCCCTGCCGT CCTCATCGTC GGCGCGCTGA TGGTCTTCTG GGCCTCCGCC 
TTCATCATCG TGGGAATACC CTCCCTCACC ATGAAGGAAA CCCCTTCCGA GATCTGGCGC
CCGCTCTCCC CGCTGGAAAA GACCGGCCAC AGGCTCTACG TCAAAAACGG CTGCAGCTAC
TGCCATTCGC TCTTCATCAG GGTGAACGAC TGGGACATAG GCGCCGAGCG GATCGCCAAG
GCTGGCGACT ATGTCGGCGT CGAGCCCGCC ATCTTGGGCT CCGAGAGAAC CGGCCCGGAC
CTCTCGCAGG AGGGGGGGGA GCACAGCGAC GACTGGAACA TCGCCCACTT CACCAACCCC
CGCTTCACCA GCCCGATCTC GCTCATGCCC TCGTGGGATT TTCTGGAGGA AAGCGAGATA
ACGGCCTTGA CCGCCTACGT CCAGGCGCAG GGGGGAAAGC ATGCGGACCT GCGTCAGGCC
CGGCAGAGGG AGTGGAAAAA GCAGGCGGTG GCGGCCTACA GCGGCGGGTT CGACCGCAAC
ATCGAGTGGC TGCACGCCCA GGTTCCCGAG GTCTGGCGGC GCATGCCGAA CCCGTACCCG
GCAACGGAAG CCGCGCTCAC GCGGGGGAAG CGGATCTACC AGCAGCTCTG CGTCAACTGC
CACTCCCCGG TCGGCGACGG CAACGGTCCG GCGATGCCCT TTCTGGCCCC CCCTCCGCTG
AACTTCACCA CGCTGCGCCG GCACGTCGTC GAGAACAGGT ACATCGGGGG GATCTTCTAC
TACCAGATCA TGAACGGGGT TACCGGGACC GCCATGCCCT ATTTCAAGAA GCACCTGGAG
TCCGAGAAGA TCTGGGATCT GGCGAATTAC CTCGCCGTTT TCTTCGTGGG GTACACCGAC
GCCAACATCG ACCCCCGCGG CATCGACGCC TCATTCGAGG GGGCGTGGGA GAACAGGTAT
CCGCCTCCGC ATAAAGCTAC GGCGGACTAG
 
Protein sequence
MKMTPAVLIV GALMVFWASA FIIVGIPSLT MKETPSEIWR PLSPLEKTGH RLYVKNGCSY 
CHSLFIRVND WDIGAERIAK AGDYVGVEPA ILGSERTGPD LSQEGGEHSD DWNIAHFTNP
RFTSPISLMP SWDFLEESEI TALTAYVQAQ GGKHADLRQA RQREWKKQAV AAYSGGFDRN
IEWLHAQVPE VWRRMPNPYP ATEAALTRGK RIYQQLCVNC HSPVGDGNGP AMPFLAPPPL
NFTTLRRHVV ENRYIGGIFY YQIMNGVTGT AMPYFKKHLE SEKIWDLANY LAVFFVGYTD
ANIDPRGIDA SFEGAWENRY PPPHKATAD