Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3158 |
Symbol | |
ID | 8138510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3667337 |
End bp | 3669130 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644870763 |
Product | cytochrome C family protein |
Protein accession | YP_003022943 |
Protein GI | 253701754 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01905] doubled CXXCH domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 118 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAG TCGAATCTGG CATATACGTA CTTCTCGCGC TGGCGGCAGC CTTCATGGCA GTCCCCTTTG CGTCTGCCGT CGAGAAGCCG CACAGCAGGG ACATCATCCA GACCCCGCAC AACCTCTCCA TCACCGGTGG AGGAGGGGCG CACGACATAA AGTCCGGCAC GGAGGCCGAG GTCTGCATCT TCTGCCACGC CCCGCACCAT GCCTCGACCG TCACCCCGCT TTGGAGCCGG GAGATATCCC CGCTGACCAT CTACGTCACC TACAAGTCGC CCACCCTCAA GGCGAACCCG CAGCAGCCGC GGGGGGCCTC GCGCCTGTGC CTCTCCTGCC ACGACGGCAC CATCGCGCTC GGGCACCTAA CCGGGGACAG GATCCTCGAC GCCTCTTTGC CTGCATTCAA GGACATGCCC CAGGAGACCG ACCCCCGCAA AAACCCGAAC CTGGGGACCG ACCTCTCCAA CGATCATCCT ATTTCCTTCT TGTACTCCGA GGCCGGCAAC CTGGAACTGC ACGACGCAAC AGCGGTTCAG GCCAAGGGAG TCAGGTTGTC CCAGGATCAG TACGTCGAGT GCACCTCCTG CCACGATGCG CACAACAACC AGTACGGCAA TTTCCTGGTG CAGGACGTGA CCTTGCAGCA GGACGCCCTG TGCACCACTT GCCACAACAA GCAGGGGTGG AGCGAGCCGG ACAGCACGCA CCGCACCGGC GGCAGCCGCT ATGACACCGT CACTGCCGGT GTCGCGGCAT CGGGCTGCAT CAACTGCCAT TTGCCGCACA ACGCGCAAAG AGGCGAGCAC CTGCTGAGAC TTTCGGGGGT GGGGGCCGGA GAGGAAACCA ACTGCTACAC CTCCTGCCAC CAAAACGTCC CGTACTCGAA CGTATGGAGC CAGTTCAACA CCTCCCTCTA CACCCATCGT GTCCAGAACT ACAACGGCGT CCATGTCGAC AACGAGAGTC TGCCGGTGGT CGGCGGCAAA AAGCACGTCG AGTGCACGGA CTGCCACAAC CCGCACTTCG CCGGGGCCCA GGGTCTGCCG CTTGGCAGTT CCACCCCCCT GGTCCCGCCT GCCTCCGCCG CGCCCGACAT CAACGGCGCG CTGCGCGGAG TGCGCGGTGT CGACCTCACG GGCGCCGCGG TGGTTTCCCC TGCGCGTTAT GAGTATGAGG TCTGCTACCG CTGCCATGCC GGACCCAGTG CCGACCAGTA CACCAGCCTG GCCCAAATGC TCCCCAATCG CCTCTTCAAG GATTACGACG AGAGCAACCG GTTCAATTCT TCCAATGCGG CATACCATCC GGTGTCGGCG GATCGCCGTC CGGGTCCCAA CGGCCGCAGT CTGCGCAGCC AGTACCAGAG CACGATGTTC CGCATCTATT GCAACGACTG CCATGATTCC CACGGCACCA ATGAGCCGCA CATGCTGCGT TACCTGAACC AGGACACCTT CCCGGCCACG GGAGGCACCA ACTACCCGCT TTGCTTCCGT TGCCACGACC CCGATTACCT GCTCAACCCG GTGGGGGCTC CTAGCTCGGA TACCGCTGTC CTGCACCAGA GACACGTACT GGGCCAGCAC CTGAACGGTG ACACGCGGCA AACCCCGTGC TCCGTCTGCC ACGACCCCCA CGGCGTTCCG GCTACTCGCG GCGCGCTATC CAGCAACGCC GCGCACTTGG TGAACTTCGA CGTGCGTTAT GCAGGAGAAA CGGCAGTATA CGACGCTGTT GCCAGGACCT GCGCCGTAAT ATGCCACACC AGCAACCCCA AGTCGTACCC ATAG
|
Protein sequence | MKKVESGIYV LLALAAAFMA VPFASAVEKP HSRDIIQTPH NLSITGGGGA HDIKSGTEAE VCIFCHAPHH ASTVTPLWSR EISPLTIYVT YKSPTLKANP QQPRGASRLC LSCHDGTIAL GHLTGDRILD ASLPAFKDMP QETDPRKNPN LGTDLSNDHP ISFLYSEAGN LELHDATAVQ AKGVRLSQDQ YVECTSCHDA HNNQYGNFLV QDVTLQQDAL CTTCHNKQGW SEPDSTHRTG GSRYDTVTAG VAASGCINCH LPHNAQRGEH LLRLSGVGAG EETNCYTSCH QNVPYSNVWS QFNTSLYTHR VQNYNGVHVD NESLPVVGGK KHVECTDCHN PHFAGAQGLP LGSSTPLVPP ASAAPDINGA LRGVRGVDLT GAAVVSPARY EYEVCYRCHA GPSADQYTSL AQMLPNRLFK DYDESNRFNS SNAAYHPVSA DRRPGPNGRS LRSQYQSTMF RIYCNDCHDS HGTNEPHMLR YLNQDTFPAT GGTNYPLCFR CHDPDYLLNP VGAPSSDTAV LHQRHVLGQH LNGDTRQTPC SVCHDPHGVP ATRGALSSNA AHLVNFDVRY AGETAVYDAV ARTCAVICHT SNPKSYP
|
| |