Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0253 |
Symbol | pepN |
ID | 8135560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 301627 |
End bp | 304269 |
Gene Length | 2643 bp |
Protein Length | 880 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644867874 |
Product | aminopeptidase N |
Protein accession | YP_003020096 |
Protein GI | 253698907 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.000158093 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATACCT GCCAGCACCA GACCGTTTAC CAGAAAGATT ATTCCGCGCC TGACTACCTC GTTGAGACAG TTGAGCTCTC TTTCGACCTC GACCCCGAAC TGACCCGGGT CGCGTCCCGG CTCAAGATCC GCTCCAACTA CGACCGGGCG CAAGGCGTGC GGCCGCTGGT TTTGGACGGA GAGGAGCTGA CCCTCGTGTC GCTCAAGCTG GACGGGGTCG AACTGGAGCA GAACCGCTAT CAGGCGGTGG ACGGCGCCCT CACCGTGACC GAACCGCCGG AGAGCTTCCT GCTGGAGGTG ACCACGCGGA TAAGCCCCAA GGCGAACAGC GCGCTCTCCG GGCTCTACGC CTCCGGCCCC ATGCTCTGCA CCCAGTGCGA GGCCGAGGGT TTCCGCCGCA TCACCTACTT CACCGACCGC CCCGACGTCA TGGCGGTCTA CACCGTCACC CTGAAAGCCG ACAAGGAGTC GTGCCCGGTG CTTTTGGCCA ACGGCAACCT GGTGGAAAAA GGGGATCTCG CCGACGGGCG GCATTTCGCC ACCTGGCACG ACCCGTTCAA AAAGCCGAGT TACCTCTTCG CCGTGGTGGC GGGGGACCTG GTCCATATCT CGGACCGCTT CACCACCATG AGCGGAAGGC CTGTGAACCT GGAGATCTAC GTCGAGGAAA AGAACCGGGG AAAGTGCGAC CACGCGCTCA GGTCGCTCAT CGAGGCCATG CGCTGGGACG AGGAGCAGTT CGGCCGCGAG TACGACCTGG ATACCTACAT GATCGTCGCC GTGGACGATT TCAACATGGG GGCAATGGAG AACAAGGGGT TGAACGTCTT CAACTCGCGC TACGTCCTGG CGAGCCCCGA GACCGCCACC GACGACGACT ACCAGGCCAT CGAAGAGGTG ATCGGACACG AATATTTCCA CAACTGGACC GGCAACCGGA TCACCTGCCG CGACTGGTTC CAGCTCTCCT TAAAGGAAGG GCTCACCATC TTCCGTGACC AGGAATTCTC CGCCGACATG CAGTCGCGCC CGGTGAAGAG GATCGCCGAC GTGAGGCTCT TGCGCTCGTC CCAGTTCCCC GAGGACGCGG GTCCCCTGGC CCACCCGGTC CGCCCCGACT CCTACGTGGA GATCAACAAC TTCTACAGCA TGACGGTCTA CCACAAGGGG GGCGAGGTGA TCAGGATGCT GCAGACCCTC CTGGGGCGCG AGGCGTTCCG CGCCGGGATG GACCTCTACT TCGAGCGGCA CGACGGCCAG GCGGTCCGGG TGGACGAATT CGTCCAGGCC ATGGCGGACG CAGGTAAGCG CGACCTCTCC CAGTTCATGC GCTGGTACAA CCAGTCAGGC ACCCCGGTCC TTACCGTGAG CGACGAGTTC GACGCGGCCA GCGGGGTCTA CACGCTGACC GTGACGCAGA GCTGCCCCGC GACGCCGGGG CAGCCCGAGA AGGAGCCGTT CCACATACCG CTGGCCGTAG GGCTCATGAC CCGCGACGGG CGGGAGCTGC CGCTGCAGCT TGAGGGAGAG AAGAGCCGGG GAGCTTCCAC CAGGGTACTG GAGCTGCGCC GGGAGACGGA GAGTTTCCGG TTCACCGGGA TGGTCTCCAA GCCGGTGCCG TCTCTCTTAC GGAACTTCTC CGCGCCGGTG AAGCTCGTGT ACCCCTACAG CGAGGCCGAC CTCACCCTTT TGATGACGAG CGACAGCGAC CCCTTCGTGC GCTGGGAGGC GGGGCAGGTG CAGGCGGTGC AGGTGATCAT GGGGCTGGTG CGGGAGATCC AGGCGGGGGG GACTCCGACG GTGCCGGAAG CCGTCATCGG CTCCTTCGGC ACACTCCTTA CCGACGAGCG GCAGGACCGC GCCTTCCTGG CCGAGGCGCT CACGCTCCCC TCCGAGGGCT ATCTCGCCGA GCAGATGGAG GTGATCGACC CGACAGCCAT CCACGAGGCG CGGGAGCTGG TGCGCGCGAA GGTGGGCGAG CGGCTGCGGG AAGAGCTGGT GGCGGCGCGC GCGGCGTGCG CCCCCAACTC CCCCTACCAC CCCGACGACG GCCTCGCCGG TTGCCGCAGG CTGAAGAACC TCTGCCTCTC TTACCTGATG GCGCCGGGAT CCCGCGAGGC GGTCGGCATG GCCATGGAGC AGTTCAAAAA CGCCGACAAC ATGACCGACA GCCTGGGCGC CCTTGCCGCG CTGGCCGGCT GCGACTGCCC CGAGCGCGAG GAGGCGCTGG AGGCCTTCTA CCGGAAATGG CGCGATGACC GCGGCGTCAT CGACAAGTGG TTCAGCCTGC AGGCGACGTC CCGTCTGCCG CAGACGCTCG ACCGGGTCCT CGAGCTTTTG GAGCACCCCG ACTTCGACAT CCGCAACCCC AACCGCGTCC GCTCGCTGGT GGGCGCCTTC AGCCAAGCGA ACCAGGTACG CTTCCACGAC CCTGAAGGGA GGGGGTACCG CTTCCTGGGC GACCAGATCC TGCGCCTGAA CGCCATCAAC CCGCAGATCG CCGCCCGCAT GCTGACCCCG TTCAGCCGCT GGCGGCGCCT CGACGCGGGG AGGCAGGAGC TGATGAAGAA AGAGCTGGAA CGTATCCTCG CCGAGCCGGG GCTGGCGCGG GACGTCTACG AGCTCGCGGC GAAGAGCTTG TAA
|
Protein sequence | MHTCQHQTVY QKDYSAPDYL VETVELSFDL DPELTRVASR LKIRSNYDRA QGVRPLVLDG EELTLVSLKL DGVELEQNRY QAVDGALTVT EPPESFLLEV TTRISPKANS ALSGLYASGP MLCTQCEAEG FRRITYFTDR PDVMAVYTVT LKADKESCPV LLANGNLVEK GDLADGRHFA TWHDPFKKPS YLFAVVAGDL VHISDRFTTM SGRPVNLEIY VEEKNRGKCD HALRSLIEAM RWDEEQFGRE YDLDTYMIVA VDDFNMGAME NKGLNVFNSR YVLASPETAT DDDYQAIEEV IGHEYFHNWT GNRITCRDWF QLSLKEGLTI FRDQEFSADM QSRPVKRIAD VRLLRSSQFP EDAGPLAHPV RPDSYVEINN FYSMTVYHKG GEVIRMLQTL LGREAFRAGM DLYFERHDGQ AVRVDEFVQA MADAGKRDLS QFMRWYNQSG TPVLTVSDEF DAASGVYTLT VTQSCPATPG QPEKEPFHIP LAVGLMTRDG RELPLQLEGE KSRGASTRVL ELRRETESFR FTGMVSKPVP SLLRNFSAPV KLVYPYSEAD LTLLMTSDSD PFVRWEAGQV QAVQVIMGLV REIQAGGTPT VPEAVIGSFG TLLTDERQDR AFLAEALTLP SEGYLAEQME VIDPTAIHEA RELVRAKVGE RLREELVAAR AACAPNSPYH PDDGLAGCRR LKNLCLSYLM APGSREAVGM AMEQFKNADN MTDSLGALAA LAGCDCPERE EALEAFYRKW RDDRGVIDKW FSLQATSRLP QTLDRVLELL EHPDFDIRNP NRVRSLVGAF SQANQVRFHD PEGRGYRFLG DQILRLNAIN PQIAARMLTP FSRWRRLDAG RQELMKKELE RILAEPGLAR DVYELAAKSL
|
| |