Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2973 |
Symbol | |
ID | 8138316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3454263 |
End bp | 3455366 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644870571 |
Product | peptidase M29 aminopeptidase II |
Protein accession | YP_003022760 |
Protein GI | 253701571 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2309] Leucyl aminopeptidase (aminopeptidase T) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.000000000872397 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAGATC CTCGCATAAG ACAATTCGCT GAAGTCCTGG TCGATTACTC GGCACGCGTG CAGAAAGGCG ACGTGGTGCT CATCTCCTGC GCCGGCACCG AAGGCGCGCC TTTGGTCAAG GAACTCTACG CGCTCTGCCT GGAGCGGGGG GCCAAGTACG TCGAGTACGA GTTCACCATT CCCGACATCA ACCGCTACTT CTACAACCTA GCCAGCGCCG ACCAGCTCTC CTACTTCCCC CAGCACAAGC TGGACTTCAT GAAGCAGGCG GACGTCTACA TCGGGCTCAC CGCGTCGGAC AACTCGATGG TGATGGCTAA GGCGAAGCAG GCCAGCATGA TCGCCTGGTC CAAGGTGGTG CGCCCCATCA TCGACCAGCG CGTCAAGCAC ACCCGCTGGG TCATCACCCG CTATCCGACC CAGTCGGCGG CCCAGGAAGC GCGCATGAGC CTCGACGAGT ACGAGGATTA CCTCTTCGCG GCCTGCTGCA TGGACTGGGA GGAGGAGTCG AGGAAGCAGG ACGCGCTCAA GGCGTGCGTG GACGCGGCGG ACCGGGTGCG CATCAAGGCC TCCGACACGG ATCTTTGCTT CAGCATCAAG GGGCTTCCCG GCATCAAGTG CGACGGGCGC CTCAACATCC CCGACGGCGA GGTCTTCACC GCGCCGGTGC GCGACTCGGT GCAGGGATAC ATCACCTACA ACTGCCCCAC CGTGTACCAG GGTAAGGAAT TCAACAACAT CCGCCTGGAG TTCGAGAACG GCCGCATCGT CCGCGCCAAC TCGCCTGGAA TGGACGAGGA GCTGAACCGG ATCCTCGACA CCGACGACGG GGCGCGTTAC GTCGGCGAAT TCGCCATCGG CGTCAACCCG AAGATCACGG TGCCTATGCG CAACATCCTT TTCGACGAGA AGATCTTCGG CTCCATCCAC TTCACGCCCG GGCAGGCCTA CGACGAGTGC GACAACGGCA ACCGCTCCGC GGTGCACTGG GACATGGTGA AGATACTGGC CGGCGACGGC GAGCTTTGGT TCGACGAGAT CCTGATCCAG AAGGACGGAC TCTTCGTCCA CGAGCCCCTG CTCGGTCTCA ACCCGGGAGC TTAG
|
Protein sequence | MKDPRIRQFA EVLVDYSARV QKGDVVLISC AGTEGAPLVK ELYALCLERG AKYVEYEFTI PDINRYFYNL ASADQLSYFP QHKLDFMKQA DVYIGLTASD NSMVMAKAKQ ASMIAWSKVV RPIIDQRVKH TRWVITRYPT QSAAQEARMS LDEYEDYLFA ACCMDWEEES RKQDALKACV DAADRVRIKA SDTDLCFSIK GLPGIKCDGR LNIPDGEVFT APVRDSVQGY ITYNCPTVYQ GKEFNNIRLE FENGRIVRAN SPGMDEELNR ILDTDDGARY VGEFAIGVNP KITVPMRNIL FDEKIFGSIH FTPGQAYDEC DNGNRSAVHW DMVKILAGDG ELWFDEILIQ KDGLFVHEPL LGLNPGA
|
| |