Gene GM21_3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3040 
Symbol 
ID8138386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3529928 
End bp3531331 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content63% 
IMG OID644870641 
Productcytochrome c oxidase subunit I 
Protein accessionYP_003022827 
Protein GI253701638 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3278] Cbb3-type cytochrome oxidase, subunit 1 
TIGRFAM ID[TIGR00780] cytochrome c oxidase, cbb3-type, subunit I 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones163 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAAC AAGAAGGGTA CGCGGACGAC GTCGTCAAGG GGTTCATCAC CTGGAGCATG 
GTGTGGGGTC TCGTGGCGGT CCTGGTCGGC GTGCTGATCT CGTTCCAGAT CGCATTCCCG
CAGTTGAACC TCCCCCCCTA CCTCACCTAC GGCAGGCTGC GCCCGATCCA CACCAACGCC
GGGATCTTCG GCTGGGGGAT CGGCAGCTTC ATGGCCTTTT TCTACTACAT CACCCAGCGC
CTCACCCGTA CCGGCATCTG GAGCCCGGGG CTGGCGCGGG TCCAGCTCTG GCTCTTCAAC
CTGGCCATAG CCCTCGCCGC GGTGACGCTG GCCCTCGGCA TGAACCGCTC CAAGGAGTAC
GCCGAGTTGG AGTGGCCGGT GGCGAGCCTC GTGGTGGTGG TCTGGGTCAT CTTCGCGGTC
AACATCATCA TGACCATCGT GAAGCGCCGC GAGGAGCAGA TGTACATCTC GCTTTGGTAC
ATCCTGGCCA CCCTGGTCGG CGTCGCGGTG CTCTACCTGG TGAACAACGC CTCCATTCCC
GTGTCGCTCA CCAAGTCCTA CTCCGCCTAC GCGGGGGCCA ACGACGCCAA CGTCCAGTGG
TGGTACGGCC ACAACGCGGT CGCCATGGTG CTCACCACTC CCCCCCTGGC CATCTTCTAC
TACTTCCTCC CCAAGGCGAC CGGGGTCCCC ATCTACAGCC ACCGCATGGG CGTGATCGCC
TTCTGGAGCC TCATCTTCAT GTACCTTTGG ACCGGGGCGC ACCACCTGCT CTGGGCGCCG
GTCCCCGACT GGGTGCAGAC CCTCGCCATG GGCTTCTCGG TGATGCTGAT CGCCCCCTCG
TGGGCCGCGG TCTTCAACGG CTACTTCTCC ATGAACGGAC AGTGGCACCA GATGCGGGAG
AACTACCTGG TCAAGTTCCT CATCTTCGGC ATCACCTTCT ACGGAACCCA GACGCTGCAG
GGGCCCTCGC AGTCGATCAG GACCTTCTCC GCCTTCATCC ATTTCACCGA CTGGGTCCCG
GGGCACGTGC ACATGGGGAC GCTCGGGTGG GTCTCCCTGG TCCTCTTCGC CGCGATCTAC
TACACCGTCC CCCGCATCTA CGGCACCGAG ATCTACTCGA TCCGCCTGGC GAACATCCAT
TTCTGGCTGG TGCTCACCGG GCAGCTCATG TTCTCCATCA GCATGTGGAT CGCCGGCGTG
CAGCAGGCGG CGATGCTGAA CGCGACCAAC CCGGACGGAA GCCTCCACTA CAGCTTCATG
GAGACCATGA TCGAGATCTA TCCCTACTGG CACATAAGGG CACTGGGCGG GGTGGTGTAT
CTCGCCGGCC TCAGCGTGTT CCTCTACAAC ATCTGGAAGA CCGTCGCCGG CGCTAAGACG
CAGGGCGCGG AGCAGACGGC TTAG
 
Protein sequence
MNQQEGYADD VVKGFITWSM VWGLVAVLVG VLISFQIAFP QLNLPPYLTY GRLRPIHTNA 
GIFGWGIGSF MAFFYYITQR LTRTGIWSPG LARVQLWLFN LAIALAAVTL ALGMNRSKEY
AELEWPVASL VVVVWVIFAV NIIMTIVKRR EEQMYISLWY ILATLVGVAV LYLVNNASIP
VSLTKSYSAY AGANDANVQW WYGHNAVAMV LTTPPLAIFY YFLPKATGVP IYSHRMGVIA
FWSLIFMYLW TGAHHLLWAP VPDWVQTLAM GFSVMLIAPS WAAVFNGYFS MNGQWHQMRE
NYLVKFLIFG ITFYGTQTLQ GPSQSIRTFS AFIHFTDWVP GHVHMGTLGW VSLVLFAAIY
YTVPRIYGTE IYSIRLANIH FWLVLTGQLM FSISMWIAGV QQAAMLNATN PDGSLHYSFM
ETMIEIYPYW HIRALGGVVY LAGLSVFLYN IWKTVAGAKT QGAEQTA