Gene GM21_0253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0253 
SymbolpepN 
ID8135560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp301627 
End bp304269 
Gene Length2643 bp 
Protein Length880 aa 
Translation table11 
GC content66% 
IMG OID644867874 
Productaminopeptidase N 
Protein accessionYP_003020096 
Protein GI253698907 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.000158093 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATACCT GCCAGCACCA GACCGTTTAC CAGAAAGATT ATTCCGCGCC TGACTACCTC 
GTTGAGACAG TTGAGCTCTC TTTCGACCTC GACCCCGAAC TGACCCGGGT CGCGTCCCGG
CTCAAGATCC GCTCCAACTA CGACCGGGCG CAAGGCGTGC GGCCGCTGGT TTTGGACGGA
GAGGAGCTGA CCCTCGTGTC GCTCAAGCTG GACGGGGTCG AACTGGAGCA GAACCGCTAT
CAGGCGGTGG ACGGCGCCCT CACCGTGACC GAACCGCCGG AGAGCTTCCT GCTGGAGGTG
ACCACGCGGA TAAGCCCCAA GGCGAACAGC GCGCTCTCCG GGCTCTACGC CTCCGGCCCC
ATGCTCTGCA CCCAGTGCGA GGCCGAGGGT TTCCGCCGCA TCACCTACTT CACCGACCGC
CCCGACGTCA TGGCGGTCTA CACCGTCACC CTGAAAGCCG ACAAGGAGTC GTGCCCGGTG
CTTTTGGCCA ACGGCAACCT GGTGGAAAAA GGGGATCTCG CCGACGGGCG GCATTTCGCC
ACCTGGCACG ACCCGTTCAA AAAGCCGAGT TACCTCTTCG CCGTGGTGGC GGGGGACCTG
GTCCATATCT CGGACCGCTT CACCACCATG AGCGGAAGGC CTGTGAACCT GGAGATCTAC
GTCGAGGAAA AGAACCGGGG AAAGTGCGAC CACGCGCTCA GGTCGCTCAT CGAGGCCATG
CGCTGGGACG AGGAGCAGTT CGGCCGCGAG TACGACCTGG ATACCTACAT GATCGTCGCC
GTGGACGATT TCAACATGGG GGCAATGGAG AACAAGGGGT TGAACGTCTT CAACTCGCGC
TACGTCCTGG CGAGCCCCGA GACCGCCACC GACGACGACT ACCAGGCCAT CGAAGAGGTG
ATCGGACACG AATATTTCCA CAACTGGACC GGCAACCGGA TCACCTGCCG CGACTGGTTC
CAGCTCTCCT TAAAGGAAGG GCTCACCATC TTCCGTGACC AGGAATTCTC CGCCGACATG
CAGTCGCGCC CGGTGAAGAG GATCGCCGAC GTGAGGCTCT TGCGCTCGTC CCAGTTCCCC
GAGGACGCGG GTCCCCTGGC CCACCCGGTC CGCCCCGACT CCTACGTGGA GATCAACAAC
TTCTACAGCA TGACGGTCTA CCACAAGGGG GGCGAGGTGA TCAGGATGCT GCAGACCCTC
CTGGGGCGCG AGGCGTTCCG CGCCGGGATG GACCTCTACT TCGAGCGGCA CGACGGCCAG
GCGGTCCGGG TGGACGAATT CGTCCAGGCC ATGGCGGACG CAGGTAAGCG CGACCTCTCC
CAGTTCATGC GCTGGTACAA CCAGTCAGGC ACCCCGGTCC TTACCGTGAG CGACGAGTTC
GACGCGGCCA GCGGGGTCTA CACGCTGACC GTGACGCAGA GCTGCCCCGC GACGCCGGGG
CAGCCCGAGA AGGAGCCGTT CCACATACCG CTGGCCGTAG GGCTCATGAC CCGCGACGGG
CGGGAGCTGC CGCTGCAGCT TGAGGGAGAG AAGAGCCGGG GAGCTTCCAC CAGGGTACTG
GAGCTGCGCC GGGAGACGGA GAGTTTCCGG TTCACCGGGA TGGTCTCCAA GCCGGTGCCG
TCTCTCTTAC GGAACTTCTC CGCGCCGGTG AAGCTCGTGT ACCCCTACAG CGAGGCCGAC
CTCACCCTTT TGATGACGAG CGACAGCGAC CCCTTCGTGC GCTGGGAGGC GGGGCAGGTG
CAGGCGGTGC AGGTGATCAT GGGGCTGGTG CGGGAGATCC AGGCGGGGGG GACTCCGACG
GTGCCGGAAG CCGTCATCGG CTCCTTCGGC ACACTCCTTA CCGACGAGCG GCAGGACCGC
GCCTTCCTGG CCGAGGCGCT CACGCTCCCC TCCGAGGGCT ATCTCGCCGA GCAGATGGAG
GTGATCGACC CGACAGCCAT CCACGAGGCG CGGGAGCTGG TGCGCGCGAA GGTGGGCGAG
CGGCTGCGGG AAGAGCTGGT GGCGGCGCGC GCGGCGTGCG CCCCCAACTC CCCCTACCAC
CCCGACGACG GCCTCGCCGG TTGCCGCAGG CTGAAGAACC TCTGCCTCTC TTACCTGATG
GCGCCGGGAT CCCGCGAGGC GGTCGGCATG GCCATGGAGC AGTTCAAAAA CGCCGACAAC
ATGACCGACA GCCTGGGCGC CCTTGCCGCG CTGGCCGGCT GCGACTGCCC CGAGCGCGAG
GAGGCGCTGG AGGCCTTCTA CCGGAAATGG CGCGATGACC GCGGCGTCAT CGACAAGTGG
TTCAGCCTGC AGGCGACGTC CCGTCTGCCG CAGACGCTCG ACCGGGTCCT CGAGCTTTTG
GAGCACCCCG ACTTCGACAT CCGCAACCCC AACCGCGTCC GCTCGCTGGT GGGCGCCTTC
AGCCAAGCGA ACCAGGTACG CTTCCACGAC CCTGAAGGGA GGGGGTACCG CTTCCTGGGC
GACCAGATCC TGCGCCTGAA CGCCATCAAC CCGCAGATCG CCGCCCGCAT GCTGACCCCG
TTCAGCCGCT GGCGGCGCCT CGACGCGGGG AGGCAGGAGC TGATGAAGAA AGAGCTGGAA
CGTATCCTCG CCGAGCCGGG GCTGGCGCGG GACGTCTACG AGCTCGCGGC GAAGAGCTTG
TAA
 
Protein sequence
MHTCQHQTVY QKDYSAPDYL VETVELSFDL DPELTRVASR LKIRSNYDRA QGVRPLVLDG 
EELTLVSLKL DGVELEQNRY QAVDGALTVT EPPESFLLEV TTRISPKANS ALSGLYASGP
MLCTQCEAEG FRRITYFTDR PDVMAVYTVT LKADKESCPV LLANGNLVEK GDLADGRHFA
TWHDPFKKPS YLFAVVAGDL VHISDRFTTM SGRPVNLEIY VEEKNRGKCD HALRSLIEAM
RWDEEQFGRE YDLDTYMIVA VDDFNMGAME NKGLNVFNSR YVLASPETAT DDDYQAIEEV
IGHEYFHNWT GNRITCRDWF QLSLKEGLTI FRDQEFSADM QSRPVKRIAD VRLLRSSQFP
EDAGPLAHPV RPDSYVEINN FYSMTVYHKG GEVIRMLQTL LGREAFRAGM DLYFERHDGQ
AVRVDEFVQA MADAGKRDLS QFMRWYNQSG TPVLTVSDEF DAASGVYTLT VTQSCPATPG
QPEKEPFHIP LAVGLMTRDG RELPLQLEGE KSRGASTRVL ELRRETESFR FTGMVSKPVP
SLLRNFSAPV KLVYPYSEAD LTLLMTSDSD PFVRWEAGQV QAVQVIMGLV REIQAGGTPT
VPEAVIGSFG TLLTDERQDR AFLAEALTLP SEGYLAEQME VIDPTAIHEA RELVRAKVGE
RLREELVAAR AACAPNSPYH PDDGLAGCRR LKNLCLSYLM APGSREAVGM AMEQFKNADN
MTDSLGALAA LAGCDCPERE EALEAFYRKW RDDRGVIDKW FSLQATSRLP QTLDRVLELL
EHPDFDIRNP NRVRSLVGAF SQANQVRFHD PEGRGYRFLG DQILRLNAIN PQIAARMLTP
FSRWRRLDAG RQELMKKELE RILAEPGLAR DVYELAAKSL