Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1193 |
Symbol | |
ID | 8136518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1385563 |
End bp | 1388619 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644868807 |
Product | cytochrome C family protein |
Protein accession | YP_003021012 |
Protein GI | 253699823 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01904] Geobacter sulfurreducens CxxxxCH...CXXCH domain [TIGR01905] doubled CXXCH domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 106 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAG TCATAGCGGC GTTTTGGTTA ATACTGCTGG CGATCTCGAC AGCCTCTGCA ATCGAGTCGC CGCACATATC CAATATGTCC TGCTCGGGGA CCATGGGGTG TCACACCATA GCGCTCATCG ACGGCGTCTA CGTAGCCACG CTGGTAGACG CCTACGGTGT CAACAACCTC TGTATCAAGT GCCACAACCC GCTGACCATG GTGCGCGGAT TCTCGGCCAG GGACATAGCG AACCCGTTCG GGTCCACCGA CACGGGGCTC TTTCCGTCCG AGCAGTTCCA GAGCTCCCAT AACTGGGCCG GACCGGTCCA TATGCCTGCC GCAGGCGCCC AAATCTCGAC AGATCCGGCG ATAATCGCAC TTAGGCCCAC GAAGGCGGGA GTCCTCGGTT GTACCAGCTG CCATAATCAC CACTCCTTCA CCGGAGACCT GTTGCTGAGG CGCCCCGGCG ATACCCTCTG CCTCGATTGC CACCGGCAGC GCAACCAGCG CGACGTCCAA AGCGGCACCC ATCCGGTCAA CTTCAACTAC ACCAGCGCAA CCTCGAAGGT GAAGCTCACC CCTGCGAAGT TCTTCGAGGA GCCCGTCAAC GCCAATCCGG CGAACCCCAC TTCCGCCATG CTGCTGCCGG GAGGGAAACT TGTCTGCACC ACCTGCCACT CCCCTCACTA CGCCGACTCC AACGCGCGGA CCTTCGACAA CGCCTCCTCC TCCACCTTCG GGCTCCTTTC TTCCTCGCGA GGGAGGCTTT TGCGCACCGA CGTGAGGGGG GGGAGCGCCG ATGAGGTGAA CATCTGTACC AACTGCCACG CCGGGAAACG CGCCCATAAC GGCAAGGGAC AGAACATCCA GTGCGCCGAC TGCCACGCAG CCCACGTCGA CCCCCTGGAC GGCACCGCCC CCAACGTCTG GCTGGTGCGC CGCTACATGA ACCCGCAGAC CACCAACGTC AGGGTCGTCA ACCAGTACAA GGACGCAGGG AGCAACTGGG CCGGCCCCGA CGGCGTCTGC GTCGCCTGCC ACGCCATACC GGCTCCGGGA GGGAACTACC CCCCCGAACA CGCAAGCACC GACCCCAACG TCTGCCGCAG CTGCCACATT CACGACAGCG CCGACGGCTC CTTCGCCGCC GGATGCAACT CCTGCCACGG CAACCCGCCG CAGACAAACG CGGCCGGGCC GAGCGGCTAC GCCAGCAGCG GGAGTTACAA CTACGCCACC AGCGGCGTCT TCAAGGACGA GTCGCTCACC CCGCACCTGG CACACACCAG CCGGGGATTG ACCTGCGCCG CCTGCCACTC CGGCAACCAG CACGCAAGCG GCGACTTCCA GCAGGTATTC CGGACTCCTT CCGGCACGGC GACCTACTAC GGCGCCGCCC CAAGCTACGA CCCGGCAGGC CCGGGGAGCT GCCTCACCAA CTACTGCCAC AGTAACGGTG CGCCTTCCAG TCTCCCCCCG GTGTACAAGA CCGCGACCTG GGCCCTGGGC AACAACAGCA TCGTCGGGAC CCCCGAAGAG TGCTCCGACT GCCACGACGC GGCCCCCGAT ACCAACGCCC ACACGGGCCA CCTTTCGCGC GGCTACAGCT GCACGGTCTG CCACGCCGCG ACCGCAGCCT CCAACAGCAG CATCAAGGAC GCCTCGAAGC ACGTGAACGG GATCAAGGAA ATGGCCTTCA TCGGCGCCGC CCTCGGTACC GAGATCGACG TCTCCGACAC CTGCACCACG AGCTACTGCC ACTCGAACGG CAGAGGGGTC TACTCCTCCC CGAACTGGAC CATGAAGGCC ACCGGCGCGT GCGGCACCTG CCATGCCACG GCCCCCGGTC TGGGGAGCCA GCTCATCGCA AGCGGCGCCC ATTTCGCCCA CTTCAGCACC TCGTCCGCGG CCTACGGGCC GATGCTGACC ACGCAAAACT CGACCGGCTG CCAGGCCTGC CACAACGTGA GCAGCGCCAA CCACGTCAAT CAGAGCATCG ATCTCAACGG CTCGCTCGGC TACCTGGGCA ACGGTACCGG TACCTGCACC CCCTGCCATC CGACGCAGGT GAACTGGAGC ACCGGAGCCG TCACCTGCGA GAGCTGCCAC ACCGGAACCG TCTCGGTGAT AAACGAGGTC CCCGCGCCGA ACAAGAGCCT GGCGGCGACT GCCGGCCACG GCGCCCCGGC GCTAGGGAAG GGGTGCACCG CCTGCCACGA GCGCAACGCG CGGCACATAA ACGGCGGCAG CCGCCTCCAG GCGCAATTGA GCGGCAGCCT CAACGCCGAC TGCAGATACT GCCACGACAA CGCCTCGGAG GTGGTCACCG AAGGCTTCCG GAACATGAGC ACCCACTTCC TGACCAAGGG GGGGAGCCAG GCCATGGCCT GCGCTAAGTG CCACGACCCG CACGGCTCGA CCAACCTGCA CATGATCAAA ACGCTCATCA ACGGCCAGGC CATCGTGTTC AACGACGCGG TTAACGGCCT GGTGAACACC ACGACCAACC AGGGGCTTTG CCAGGTCTGC CACACCCAGA CCGCCCACTA CCGCGCCGGC GTGCCGGAGA CCTCGCATCC GACGACAAAC TGCCTCTCCT GCCACGACCA CCGGGCAGCC GGAGGCGCCT TCAAGCCAGC CGGAACCTGC GACGCCTGCC ACGGCTACCC CCCCGCACCC AAGGCGACCA TAACCCCGCA GCTCTTCGGC GTGATGGGTA GCTGGTCTTC GGCGCGCTAC GAGGATTACT CCGGAGGCGG CGGCGCCCAC CTGGTCGCCG CTCATGTTTC CCCAAATGCC AAGCCGAGCG AGGGGTGGAG CAACTGCGCC ATCTGCCACA GCGGCGGCTC CACGGGCGAC TCCGGCAACC ATAAGATGAC GATGCCGCTG AAAGGGCACA TCGAGAACGT CGACCTGGTC GTGGATAAGA GGTTCCGCTT CGCCAACAGT TTCATCGTCT ACACCGGGTC GCAAAGGGTC AGCGCTCCGG CGCAGAACGC GACCGGGAGC TGCTACAACG TGAGCTGCCA CATTACCAAG TCACGGCGCT GGAGTATCGA GAGGTAA
|
Protein sequence | MKRVIAAFWL ILLAISTASA IESPHISNMS CSGTMGCHTI ALIDGVYVAT LVDAYGVNNL CIKCHNPLTM VRGFSARDIA NPFGSTDTGL FPSEQFQSSH NWAGPVHMPA AGAQISTDPA IIALRPTKAG VLGCTSCHNH HSFTGDLLLR RPGDTLCLDC HRQRNQRDVQ SGTHPVNFNY TSATSKVKLT PAKFFEEPVN ANPANPTSAM LLPGGKLVCT TCHSPHYADS NARTFDNASS STFGLLSSSR GRLLRTDVRG GSADEVNICT NCHAGKRAHN GKGQNIQCAD CHAAHVDPLD GTAPNVWLVR RYMNPQTTNV RVVNQYKDAG SNWAGPDGVC VACHAIPAPG GNYPPEHAST DPNVCRSCHI HDSADGSFAA GCNSCHGNPP QTNAAGPSGY ASSGSYNYAT SGVFKDESLT PHLAHTSRGL TCAACHSGNQ HASGDFQQVF RTPSGTATYY GAAPSYDPAG PGSCLTNYCH SNGAPSSLPP VYKTATWALG NNSIVGTPEE CSDCHDAAPD TNAHTGHLSR GYSCTVCHAA TAASNSSIKD ASKHVNGIKE MAFIGAALGT EIDVSDTCTT SYCHSNGRGV YSSPNWTMKA TGACGTCHAT APGLGSQLIA SGAHFAHFST SSAAYGPMLT TQNSTGCQAC HNVSSANHVN QSIDLNGSLG YLGNGTGTCT PCHPTQVNWS TGAVTCESCH TGTVSVINEV PAPNKSLAAT AGHGAPALGK GCTACHERNA RHINGGSRLQ AQLSGSLNAD CRYCHDNASE VVTEGFRNMS THFLTKGGSQ AMACAKCHDP HGSTNLHMIK TLINGQAIVF NDAVNGLVNT TTNQGLCQVC HTQTAHYRAG VPETSHPTTN CLSCHDHRAA GGAFKPAGTC DACHGYPPAP KATITPQLFG VMGSWSSARY EDYSGGGGAH LVAAHVSPNA KPSEGWSNCA ICHSGGSTGD SGNHKMTMPL KGHIENVDLV VDKRFRFANS FIVYTGSQRV SAPAQNATGS CYNVSCHITK SRRWSIER
|
| |