Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3443 |
Symbol | |
ID | 8138810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 3979161 |
End bp | 3980702 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644871059 |
Product | cytochrome c peroxidase |
Protein accession | YP_003023224 |
Protein GI | 253702035 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1858] Cytochrome c peroxidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 8.41254e-25 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGAAAAG ATGCCGCGCT CGTTTCCTCA CTTTTTCTTG CAGCAGCCCT GATCATCCCG GCGACCAAGT CGTGGACGCA GCCACAGGGA GAGACCGCCA GGAAGGCGCC CTATGACGCG GTCCAGGATG CGCAGACGCC CAGGGAGGTC GCAAGGACCC GCCCCGAGCA GCAAAAGGAC CAGCACGACA ACTCGGACCT GTTCACGGCC ACGAAGGCGG AACCCTCCTC GACGGCTTTT AAGAACCAGC CCGACGAGGG AAAGATCCTC GGTTTCGATT TCTACCGCGA TCCCCTCAAC GCCAAGAAAC CGATGACAAC CTTTCAGGAG GTCTACCAGA AGGACGTGGC CGAGAAGCCG AAGGTGATGG CGACCCAAAG GCGCCTTCTG GAGATGAGGT ACAACCTGAA GCCGAACCTC TCACCTGACG TGAAGATGAC CCGGGGCAAG CCTATCGCGG TAGGACCGAC CGCGCTCCTT GCCCAAGGAA CTACCTGGGA GAAGTTGTCG GCGATGCCGC CCGAGCAGAT CAGGTCGGGG AACCTCTTCC CTTACCCCCC TCTGCCCCAT CCCAAGCAGG TCAACGGCGG GCAGGTCTTC CCCCAGATCC AGATCGACAT GTTCCCGAGG CTGCAGCGCT TCGACGTCGA TTTCGACCTA CCCGATGCCT TCCTCCCCGA GTTCCCCCCC GCCATCTTCC TGCAGAACCG CCCAGAACTG GGAGACGTTT CCCGCGGCGA GGTGGTCAGC ATCAACAACT TCTACCGCCT CTTCAAGGAC CTCCTCACCC CGGTGCAGTT GGACGGCTTG CGGATGCTGG TGACCCCCTT CCCGCAGGAA GAGTTCAACC CCACCGACGA CCGCAAATCC CCGCAGGCCA GCCTCGGGGT CGCCTGCCTC GACTGCCACG TCAACGGGCA CACCACCGCT CAGTTCCACC TGAGCCCCGA CATCCGTCCC CAGGAGCGCC GCTTCCGGCT CGACACGACC AGCCTGAGGG GGCTATATAA CCAGCAGATC CACTCCTCCA AGCGCAGCCT GCGCTCGGTC GAGGATTTTA CCGAATTCGA GCAGCGCACC GCCTACTTCA ACGGGGACGA AATCCACGCC GCCAAAAAAG GGATGAACAT CCTGAGCCGG GTCCAGGTCA GCCACATGGC CCAGATGCAG AACATGTTCG ACGTACCTCC CGCACCCAAG CTCGACCCTG CCGGTTACCT GGCCCCCATG AAGGCCACCC CGGCGGAAAT AGCGGGTCAG AAGATCTTCT TCGGGAAGGG TAGATGCGGC ACCTGCCACC CCGCCCCGTT CTACCTGGAT CACCAGATGC ACGACCTGCA GATGGAGCGC TTCACCCGCG AGCCGGGCGA CGGCCCCATC AAAACCTTCA CCCTAAGGGG GATCAAGGAA AGCCCCCCGT ACATGCATGA CGGTCGTTGC CTCACCCTGG AGGACACGGT GAAGTTCTTC AACCTGGTGC TCGGGCTCAA ACTTTCCGCG GAGGAGGAGA CCAACCTGGT CGCCTTCCTG CGGGTGCTCT AG
|
Protein sequence | MRKDAALVSS LFLAAALIIP ATKSWTQPQG ETARKAPYDA VQDAQTPREV ARTRPEQQKD QHDNSDLFTA TKAEPSSTAF KNQPDEGKIL GFDFYRDPLN AKKPMTTFQE VYQKDVAEKP KVMATQRRLL EMRYNLKPNL SPDVKMTRGK PIAVGPTALL AQGTTWEKLS AMPPEQIRSG NLFPYPPLPH PKQVNGGQVF PQIQIDMFPR LQRFDVDFDL PDAFLPEFPP AIFLQNRPEL GDVSRGEVVS INNFYRLFKD LLTPVQLDGL RMLVTPFPQE EFNPTDDRKS PQASLGVACL DCHVNGHTTA QFHLSPDIRP QERRFRLDTT SLRGLYNQQI HSSKRSLRSV EDFTEFEQRT AYFNGDEIHA AKKGMNILSR VQVSHMAQMQ NMFDVPPAPK LDPAGYLAPM KATPAEIAGQ KIFFGKGRCG TCHPAPFYLD HQMHDLQMER FTREPGDGPI KTFTLRGIKE SPPYMHDGRC LTLEDTVKFF NLVLGLKLSA EEETNLVAFL RVL
|
| |