Gene GM21_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0042 
Symbol 
ID8135341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp54227 
End bp55855 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content61% 
IMG OID644867659 
Productcytochrome c oxidase, subunit I 
Protein accessionYP_003019887 
Protein GI253698698 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.00140919 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTCCGG CGGAAAACAT CACGACATCG GCCCTTGGCG GGTTCTGGAG CGACACGGGC 
AAAACCGGCA TCCGCTCCTG GATATTCTCG ACCGACCACA AGCGGATCGG GCTGCTTTAC
TTCTACTCGG TCTTCGGATT CTTCCTGGTC GGGGCCTTGC TGGGCCTTTT GATCCGGCTG
GAACTGATTG CGCCGGGAGA GACGATCGTC CATGCCGCGA CCTACAATGC CCTCTTCACG
GTGCACGGCG TGGTGATGAT CTTTCTCTTC ATCATCCCCG GGATTCCGGC GTCGTTTGGC
AACCTGGTGC TGCCGATACA AATAGGCGCC CGCGACGTGG CCTTTCCCCG CTTGAACCTC
TTTTCCTGGT GGCTCTACAC GACCGGGGCG GTGGTCGTAC TGCTTTCGCT TTTTACCGGC
GGCGGCCCGC CCGACACGGG GTGGACTTTT TACGTCCCCT TCAGCGTCCG GACCGGCACC
AACGTTTCGC TTGCAGTACT TGGGGTCTTT ATCCTCGGCT TCTCCTCCAT CCTTACCGGG
ATCAACTTCG TCACCACCAT CCACCGGATG AGGGCGCAGG GGATGACCTG GACCAGGATA
CCGCTGTTCA CCTGGTCTCT CTACGCGACC GCCTGGGTGC AGATCCTCGC CACGCCCATA
ATCGCCATCA CGCTGGTGCT GGTCGCAGCG GAGCGGATAC TGGGACTTGG CTTGTTCGAG
CCGAGCCGCG GCGGCGACCC GATCATGTTC CAGCACCTGT TCTGGATCTA TTCACATCCT
GCCGTCTACA TCATGATCCT CCCGGGGATG GGGGTGATCT CAGACGTGAT CCCCGTTTTC
GCCAGGAAGC CGATCTTCGG GTACAAGATG ATCGCCTTCT CAAGCATCGC CATAGCGGCG
GCGGGCTCGG CGGTCTGGGG GCACCACATG TACACAAGCG GCATGAGCGA CATGGCGGTG
CTGCTCTTCT CCTTTCTCAC CTTCCTGGTC GCCATACCCT CGGCCATCAA GGTCTTCAAC
TGGATCTCGA CGCTGTACAA GGGGTCGATC TCCCTGGAGG CGCCGATGCT GTTCGCGCTC
TCCTTCATCC TGCTCTTCTC CATCGGGGGG CTGAGCGGTC TGATCCTCGG CGCTGCGGCT
ACCGACATCC ACGTACATGA CACCCATTTC GTGGTCGGGC ACTTCCATTT CGTGATGTTC
GGCGGTACCG GTTTCGCCTT TTTCGCCGCG GCCCATTACT GGCTGCCGAA ATTCTACGGG
CGCAGGTATC AGGAGAAGCC TGCGATCATC GGATGGCTGC TGATGTTCTC GGGCTTCATC
GTCCTTTACC TGAGCATGCA GACGGTCGGC ATGCAGGGGA TGCCCCGCCG CTACTACGAC
TACCTGCCGG AGTTCACCCA GCTCAACGTG GTGGCCACCG TCTCAAGCTG GGTGATGATG
GCGGGGGTGT TCATTGTGGT CTGGAACCTT TTCCGCGGAC TGTTCCGGGG CGAGCCGTTC
ACCGGGAACC CATGGGGAGG CGCCTCGCTG GAGTGGAGCG TTCCCACCCC GCCGCCGACG
GAAAATTTCC ATGAGGAGCC GGTGGTGACG CACGGTCCGT ATGATTTTAA GGAGGCAGGG
GTCTTATGA
 
Protein sequence
MSPAENITTS ALGGFWSDTG KTGIRSWIFS TDHKRIGLLY FYSVFGFFLV GALLGLLIRL 
ELIAPGETIV HAATYNALFT VHGVVMIFLF IIPGIPASFG NLVLPIQIGA RDVAFPRLNL
FSWWLYTTGA VVVLLSLFTG GGPPDTGWTF YVPFSVRTGT NVSLAVLGVF ILGFSSILTG
INFVTTIHRM RAQGMTWTRI PLFTWSLYAT AWVQILATPI IAITLVLVAA ERILGLGLFE
PSRGGDPIMF QHLFWIYSHP AVYIMILPGM GVISDVIPVF ARKPIFGYKM IAFSSIAIAA
AGSAVWGHHM YTSGMSDMAV LLFSFLTFLV AIPSAIKVFN WISTLYKGSI SLEAPMLFAL
SFILLFSIGG LSGLILGAAA TDIHVHDTHF VVGHFHFVMF GGTGFAFFAA AHYWLPKFYG
RRYQEKPAII GWLLMFSGFI VLYLSMQTVG MQGMPRRYYD YLPEFTQLNV VATVSSWVMM
AGVFIVVWNL FRGLFRGEPF TGNPWGGASL EWSVPTPPPT ENFHEEPVVT HGPYDFKEAG
VL