Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1291 |
Symbol | |
ID | 8136618 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1511356 |
End bp | 1512819 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644868905 |
Product | Di-heme cytochrome c peroxidase |
Protein accession | YP_003021109 |
Protein GI | 253699920 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1858] Cytochrome c peroxidase |
TIGRFAM ID | [TIGR02953] pentapeptide MXKDX repeat protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 1.5340600000000002e-32 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTTCAA AACTCGTACT GACGACGCTG TCTGTAGTGT TGGTAGCTGT ACCGGTAGCG GCGGCGGAAC TTACCCCGCT GGAAAGCTTG GGCAAGAATC TCTTTTTCGA CCAGTCTCTT TCCAATCCTC CAGGTCAGGC CTGTGTAGAT TGCCATAGCC CTGAAACTGG GTGGACCGGA CCGGATTCGG AGATCAACGC CGCAGGTGCG GCTATTCCTG GAGCGGTCCA CACGAGGGCG GGCAACAGGA AACCTCCGAC CGCGGCCTAT GCCGGATACA ACCCGGTACT GCACAAAGCC GGCTCCATGG GCGGCGGTGG CATGGGCGGT GGTGGCATGG GCGGTGGTGG CATGGGCGGC GGCGGCATGG GCGGAGGCGG TATGGGCGGA GGCGGTATGG GCGGTGGCGG CATGGGCGGT GGCGGCATGG GCGGTGGCGG CATGGGTGGT GGCGGCATGG GTGGCAACAT GCAGGATGTA TTCGTGGGCG GGATGTTCTG GGATGGCCGG GGCACGGGTT GGGAGATGGG GGATCCGCTT GCCGAACAGG CGATGGGTCC TTTCCTGAAC CCGCTGGAAC AGAACAATCC CAACGCGAAG CATGTCTGTC TCAACGTTCT GAGGACCGGG TACGCGACGC AGTTCGAAGA GGTTTGGGGA GCAGGTTCCC TGGACTGCGT GAAGGACGTT GACGGCACCT ACCAGCGCAT TGGCCGCTCT ATCGCTGCCT ACGAGCGTTC CGCCGAGGTT AGCGCGTTCA ACTCCAAGTT CGATACCTTC TGGAAGAACT CGGAGGGGAA GATGCCGCCG GTTCCCATGA TCAACATGAT GAACTGGACC CGGTTCAAAA AACGCGGCCT GACGGACATG GAGCTGCAGG GTCTGATGAT CTTCAACACC AAGGGGAAGT GCTCCACCTG TCACTTCTTG CAACCGATGA ACGGAAGCCG GTTCCCGCTT TTCACCGATT TCAGGTACCA CAACCTGGGC GTGCCCGCCA ATCCTGAGAA CCCGTACTAC GACATGCCGC GCCAGTGGAA CCCGAAAGGG GAGAACTGGG TCGATCAGGG GCTGGGCGGA TTTTTGGCCA AGACGGCGGT GATGACGGAC AGCGCCGGCG TTTCCATGGA CTACAGCGCG CTGGCGGCTC AGAACATGGG CAAACAGAGG ACGCCGACGC TGCGCAACGT GGACAAGCGT CCGGGACCGG ACTTCGTCAA GGCTTTCGGC CACAACGGCC ATTTCAAAAC TCTGCAGGAG ATCGTGCACT TCTATAACTT GAGGGACGTC CTTCCGATTT GCGACACCCC CAACCCGCCC AAGGACGCCA TGGGTGGCGC AACCTGCTTC CCTCCTCCCG AAGTGGCGGA AAACATCAAC AGGGTAGACA TGGGGAACCT CGGGCTGACC CCCCAAGAGG GAATGGCGCT GATCCAGTTT ATGAAGACCC TGAACGATCT GTAA
|
Protein sequence | MISKLVLTTL SVVLVAVPVA AAELTPLESL GKNLFFDQSL SNPPGQACVD CHSPETGWTG PDSEINAAGA AIPGAVHTRA GNRKPPTAAY AGYNPVLHKA GSMGGGGMGG GGMGGGGMGG GGMGGGGMGG GGMGGGGMGG GGMGGGGMGG GGMGGNMQDV FVGGMFWDGR GTGWEMGDPL AEQAMGPFLN PLEQNNPNAK HVCLNVLRTG YATQFEEVWG AGSLDCVKDV DGTYQRIGRS IAAYERSAEV SAFNSKFDTF WKNSEGKMPP VPMINMMNWT RFKKRGLTDM ELQGLMIFNT KGKCSTCHFL QPMNGSRFPL FTDFRYHNLG VPANPENPYY DMPRQWNPKG ENWVDQGLGG FLAKTAVMTD SAGVSMDYSA LAAQNMGKQR TPTLRNVDKR PGPDFVKAFG HNGHFKTLQE IVHFYNLRDV LPICDTPNPP KDAMGGATCF PPPEVAENIN RVDMGNLGLT PQEGMALIQF MKTLNDL
|
| |