Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1198 |
Symbol | |
ID | 8136523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1399323 |
End bp | 1402754 |
Gene Length | 3432 bp |
Protein Length | 1143 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644868812 |
Product | cytochrome C family protein |
Protein accession | YP_003021017 |
Protein GI | 253699828 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01904] Geobacter sulfurreducens CxxxxCH...CXXCH domain [TIGR01905] doubled CXXCH domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 169 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAGCT GCCGCAGTCT GAAATGCTGC AACCTGATCT TGAGTTTGAC GATGCTGTTC CTTGTTATGG GCATTCTCTC CGTCTCCGGT CAACCGGCGC ATGCTTCAAC GCGGCAGTAC GTCATGACCT GCATCTCCTG CCATAAAATG CCTCCCCTCG ACTCAGCCGA CGGGACCAGG ATTCCGTACA CGGGCGCCTT AAAAGGGAGC CATTTGGGGC ATGCGTCGGC CTCAACCTCC TCCTGCGCTA AATGCCATCG CGACGATGTC GCCAACTACC GTACCGCCCA TCGAAACCGT TTGATCGAAA TTTCCCCCGC AATCAACAGC GTGGCCGGCG CCGCCTACAG TCGCGGCTTC TTCAACCAGA CCTCGGTTCC ACCGGCCATC CTCGGGACCT GCTCCAGCGT CGACTGCCAC TTCGAGTCGA CTACCCCTTC TTGGGGAACT ACGGTGCTCA AAGCTCCCGA AGACTGCTCG GCCTGCCACG GCTCCGCGCC TGCCGACGGG AACCACCCGG GGTCCGGGCA AAAGCACGGC GTCTACTACG ACACCGGCAC CGGCTCCTGC GCCGTATGCC ATCCCGATCA TCTTGGGGAC GCGAAGCCCT TCGCCCACGC CACCAGCGCA GGGAGCCGGG CCCTCGCGGT TCAGTTCACC ACCCTCCCGA ACAAGGGGGG GAGCTACTCG CGAGACCTGA GCTATCCCAA TTACCTCCCC AGTAAATCCG GGTTGCGCAA CGGCAGCTGC CTTGGGCTGT ACTGCCACAG TCCGGGGAAC AAGAACAGCA GCTTCGATCC CCCGAATCAG GCCGCAACCT GGGGGGGCAC CCTCAACTGC GCCGGCTGCC ACAAGGCGGG CCTCGCCTCC GGCAGTGTCA TGACCAGCGG CAGTCACGGC AAGCACGTCG ACGGGTCGGT CTCTTCCTTT GCCTGTTCCA AATGCCACTT TGCCACCGCC ACTAGTTCGA TGACCATCGC CGACGTGACG CAGCACGTAA ACGGACGCGT CGACATCTTC TTCGGCGTAA GCACGAGCGC TGCCAACGGC TCGTACAACG GTCTCATCTC GCCGGTATCT AAACTCCCCG GCAGCGGATA CGGCGCCTGC TCCAACGTCT ACTGCCACTC CAACGGACAA AGCGAAGGCG GCGTAGGCAT CGCCTACCGC ACACCAATCT GGGGCAGCGG CACCACCGGC AAATGCGGCT CCTGCCACGC AGACGGCAGC GGCCATAACG ACGCCGTTCC CGCCATGTCC AGCGGCAGCC ATAAAAAGCA CTTGTCCTAC ACCCTGCTTG CCACCAGCGG CCCGGTCCGT TGCACCATCT GCCACAACGT TAAAGGCGCG AAATTCACCG CGTATGCGTC GTGCAGCCAG ATGAGCTGTC ACTCCACCGG CGGGGCAATC AAGCACTCCG ACCAGGAGAT CGACGTGAGC CTGGTTAGTT ACTTCGGCGG GGTCTACGAC GGCACACGGG CTCCAGGCGA CGGCTACGGC GCCTGCGCCA ACGTCTACTG CCACAGTAAC GGGCAGGCCA CCCCGAGCTA CGCCCCTCCC GTCACCTGGG GCGCCGTGAC CTTGCGCTGC GATGCGTGCC ACGGCTCCGC CACCAGCAAG GGGGGAAGCG ACACCACGAC GTCTCTTTCC GGCAAGCACG CCGCGCACGT GAACAACGCT TTGGTGCTCG GCGCCGGCAA AAGCCTGCAC TGCATCGACT GCCACAGCAT CACCGTCAGC AGCGACACGA CCATCGCCTC GACCGCGGTC CACGTCAACA AGATGCTCAA CTACACGGGG CATTATGCCG GCGGTCCCAG GCGCTACAGC AGCACCACGA AAGTCTGCTC CAACATCTAC TGCCACAGCT CCGGCCAGGC AAAGCCGGTG TTCCGTAACA TGACCGGCAC CAAGTCCTGG GCCTCGACCG GCACTCTTTC CTGCAACGGT TGCCACGGCT ACGGACCGGG CACCTTCGCC TCGGTGGCGG GCGAGCCGAA CTACCTGAAC GGCGGCGCCG GTTCGGGCAC CGCAAACTCG CACCAGAAAC ATATGGCGGG GGCGAATCTG CTGGATTCGC GCGGTTGTGC CAAGTGCCAC CGCAGTACCG CGGACCAGGG GATGGCCGGG AAGCTGCGCG ACTACAGTTC GGCGCACCTC AACGGCTCCC GCGACGTGAG CTTCGCCGTG CTCGGCAATA TCTCAGGTCA CTACAGTGCA GCCGCCAAGA CCTGCTCCAA TACCTACTGC CACGGGGGGG GCTCCATCCA GTGGGGGGGG CAAGGGCCGC TCGCCTGCAA CAGTTGCCAC GGTGACGCCG AGACGCTCGG CACCAACGCC CACGCCCGCC ACATAAGCCC GAGCTCCGGC AAGGCGATCT CCTGCGCCAT CTGTCATGCC GCGACCGCAG CCGGCAACGG CTCGATCGCA GACGGCACCA TTCATGCCGA TGGCAAAAAG GACGTCGTCT TCTCCGGCGC GGCCCTAGGG ACACAGATGG ACCTCACGGG CAACTGCTCG ACGAGTTACT GCCACAGTAA CGGCAAGGGA AGTTACTCCA CCCCTAACTG GTCCGCGAAC TCTTCCGGCG CTTGCGGTAC CTGCCATGCG ACGGCGCCCG GGCTCGGCAG CCCCCTCATC GCAAGCGGCG CCCATTTCAG CCACTTCAGC ACAGCCGCTA CCAGCTATGG TCCAATGTTC AGCACGGGTA ACGTGACCGG CTGCCAGGCT TGTCACGACT TCGGCAACGA GTTGGCTTCC ACTCACATCG ACCAGACGGT GAATGTGAAC AGCTCGCTCG GGTATTCCAC TAGCGGCACC TGCACTCCCT GCCACACCAA GGAAGTTAGC TGGACCGGGG GAGCCGTCTC CTGCGAGAGC TGCCACGCCG GCACACTCTC CGTGATAAAT GGCGTCACCG CCAGCGACAA GAGTCAGGCT GCCACGCGCG GCCACGGCGG CCCGACGATC GGGAAGGGAT GTACCGACTG CCACGAACGC AACGCGCGGC ACATAAACGG CGGCTCCCGC CTCCGGGCGC AATTTTCCGG CGGGCTGAAC CTTGAGTGCA ACTACTGCCA TGACGACTCC TCAGTTTTGC TGGACCCGGA CTCCCGGAAC ATGAGTACCC ATGTTCTGGT CAAGGGGGGG ACCCCGGCGA TGGAGTGCGC CCAATGCCAT GACCCGCACG GCTCGAATAA TCTCAAGATG ATCAGAGCCG TCATCAACGG CAAGGAGATC GTCTTCAACG ACATGATGAA CGGCCTGATC GACACCGTGA CCAACCAGGG CATTTGCCAG GTCTGCCACA CTCAGACCTC CCACTACCGC GCCGGAATTC CCGAAACCGA TCACCCGACC TCGGGTTGCC TCTCCTGCCA CCCGCATGTC GGAGCCGAGG CCGCGTTCCT TCCACAGTCT CGGCGATACT AA
|
Protein sequence | MNSCRSLKCC NLILSLTMLF LVMGILSVSG QPAHASTRQY VMTCISCHKM PPLDSADGTR IPYTGALKGS HLGHASASTS SCAKCHRDDV ANYRTAHRNR LIEISPAINS VAGAAYSRGF FNQTSVPPAI LGTCSSVDCH FESTTPSWGT TVLKAPEDCS ACHGSAPADG NHPGSGQKHG VYYDTGTGSC AVCHPDHLGD AKPFAHATSA GSRALAVQFT TLPNKGGSYS RDLSYPNYLP SKSGLRNGSC LGLYCHSPGN KNSSFDPPNQ AATWGGTLNC AGCHKAGLAS GSVMTSGSHG KHVDGSVSSF ACSKCHFATA TSSMTIADVT QHVNGRVDIF FGVSTSAANG SYNGLISPVS KLPGSGYGAC SNVYCHSNGQ SEGGVGIAYR TPIWGSGTTG KCGSCHADGS GHNDAVPAMS SGSHKKHLSY TLLATSGPVR CTICHNVKGA KFTAYASCSQ MSCHSTGGAI KHSDQEIDVS LVSYFGGVYD GTRAPGDGYG ACANVYCHSN GQATPSYAPP VTWGAVTLRC DACHGSATSK GGSDTTTSLS GKHAAHVNNA LVLGAGKSLH CIDCHSITVS SDTTIASTAV HVNKMLNYTG HYAGGPRRYS STTKVCSNIY CHSSGQAKPV FRNMTGTKSW ASTGTLSCNG CHGYGPGTFA SVAGEPNYLN GGAGSGTANS HQKHMAGANL LDSRGCAKCH RSTADQGMAG KLRDYSSAHL NGSRDVSFAV LGNISGHYSA AAKTCSNTYC HGGGSIQWGG QGPLACNSCH GDAETLGTNA HARHISPSSG KAISCAICHA ATAAGNGSIA DGTIHADGKK DVVFSGAALG TQMDLTGNCS TSYCHSNGKG SYSTPNWSAN SSGACGTCHA TAPGLGSPLI ASGAHFSHFS TAATSYGPMF STGNVTGCQA CHDFGNELAS THIDQTVNVN SSLGYSTSGT CTPCHTKEVS WTGGAVSCES CHAGTLSVIN GVTASDKSQA ATRGHGGPTI GKGCTDCHER NARHINGGSR LRAQFSGGLN LECNYCHDDS SVLLDPDSRN MSTHVLVKGG TPAMECAQCH DPHGSNNLKM IRAVINGKEI VFNDMMNGLI DTVTNQGICQ VCHTQTSHYR AGIPETDHPT SGCLSCHPHV GAEAAFLPQS RRY
|
| |