Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0042 |
Symbol | |
ID | 8135341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 54227 |
End bp | 55855 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644867659 |
Product | cytochrome c oxidase, subunit I |
Protein accession | YP_003019887 |
Protein GI | 253698698 |
COG category | [C] Energy production and conversion |
COG ID | [COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 |
TIGRFAM ID | [TIGR02891] cytochrome c oxidase, subunit I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 0.00140919 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTCCGG CGGAAAACAT CACGACATCG GCCCTTGGCG GGTTCTGGAG CGACACGGGC AAAACCGGCA TCCGCTCCTG GATATTCTCG ACCGACCACA AGCGGATCGG GCTGCTTTAC TTCTACTCGG TCTTCGGATT CTTCCTGGTC GGGGCCTTGC TGGGCCTTTT GATCCGGCTG GAACTGATTG CGCCGGGAGA GACGATCGTC CATGCCGCGA CCTACAATGC CCTCTTCACG GTGCACGGCG TGGTGATGAT CTTTCTCTTC ATCATCCCCG GGATTCCGGC GTCGTTTGGC AACCTGGTGC TGCCGATACA AATAGGCGCC CGCGACGTGG CCTTTCCCCG CTTGAACCTC TTTTCCTGGT GGCTCTACAC GACCGGGGCG GTGGTCGTAC TGCTTTCGCT TTTTACCGGC GGCGGCCCGC CCGACACGGG GTGGACTTTT TACGTCCCCT TCAGCGTCCG GACCGGCACC AACGTTTCGC TTGCAGTACT TGGGGTCTTT ATCCTCGGCT TCTCCTCCAT CCTTACCGGG ATCAACTTCG TCACCACCAT CCACCGGATG AGGGCGCAGG GGATGACCTG GACCAGGATA CCGCTGTTCA CCTGGTCTCT CTACGCGACC GCCTGGGTGC AGATCCTCGC CACGCCCATA ATCGCCATCA CGCTGGTGCT GGTCGCAGCG GAGCGGATAC TGGGACTTGG CTTGTTCGAG CCGAGCCGCG GCGGCGACCC GATCATGTTC CAGCACCTGT TCTGGATCTA TTCACATCCT GCCGTCTACA TCATGATCCT CCCGGGGATG GGGGTGATCT CAGACGTGAT CCCCGTTTTC GCCAGGAAGC CGATCTTCGG GTACAAGATG ATCGCCTTCT CAAGCATCGC CATAGCGGCG GCGGGCTCGG CGGTCTGGGG GCACCACATG TACACAAGCG GCATGAGCGA CATGGCGGTG CTGCTCTTCT CCTTTCTCAC CTTCCTGGTC GCCATACCCT CGGCCATCAA GGTCTTCAAC TGGATCTCGA CGCTGTACAA GGGGTCGATC TCCCTGGAGG CGCCGATGCT GTTCGCGCTC TCCTTCATCC TGCTCTTCTC CATCGGGGGG CTGAGCGGTC TGATCCTCGG CGCTGCGGCT ACCGACATCC ACGTACATGA CACCCATTTC GTGGTCGGGC ACTTCCATTT CGTGATGTTC GGCGGTACCG GTTTCGCCTT TTTCGCCGCG GCCCATTACT GGCTGCCGAA ATTCTACGGG CGCAGGTATC AGGAGAAGCC TGCGATCATC GGATGGCTGC TGATGTTCTC GGGCTTCATC GTCCTTTACC TGAGCATGCA GACGGTCGGC ATGCAGGGGA TGCCCCGCCG CTACTACGAC TACCTGCCGG AGTTCACCCA GCTCAACGTG GTGGCCACCG TCTCAAGCTG GGTGATGATG GCGGGGGTGT TCATTGTGGT CTGGAACCTT TTCCGCGGAC TGTTCCGGGG CGAGCCGTTC ACCGGGAACC CATGGGGAGG CGCCTCGCTG GAGTGGAGCG TTCCCACCCC GCCGCCGACG GAAAATTTCC ATGAGGAGCC GGTGGTGACG CACGGTCCGT ATGATTTTAA GGAGGCAGGG GTCTTATGA
|
Protein sequence | MSPAENITTS ALGGFWSDTG KTGIRSWIFS TDHKRIGLLY FYSVFGFFLV GALLGLLIRL ELIAPGETIV HAATYNALFT VHGVVMIFLF IIPGIPASFG NLVLPIQIGA RDVAFPRLNL FSWWLYTTGA VVVLLSLFTG GGPPDTGWTF YVPFSVRTGT NVSLAVLGVF ILGFSSILTG INFVTTIHRM RAQGMTWTRI PLFTWSLYAT AWVQILATPI IAITLVLVAA ERILGLGLFE PSRGGDPIMF QHLFWIYSHP AVYIMILPGM GVISDVIPVF ARKPIFGYKM IAFSSIAIAA AGSAVWGHHM YTSGMSDMAV LLFSFLTFLV AIPSAIKVFN WISTLYKGSI SLEAPMLFAL SFILLFSIGG LSGLILGAAA TDIHVHDTHF VVGHFHFVMF GGTGFAFFAA AHYWLPKFYG RRYQEKPAII GWLLMFSGFI VLYLSMQTVG MQGMPRRYYD YLPEFTQLNV VATVSSWVMM AGVFIVVWNL FRGLFRGEPF TGNPWGGASL EWSVPTPPPT ENFHEEPVVT HGPYDFKEAG VL
|
| |