Gene GM21_2487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2487 
Symbol 
ID8137828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2909043 
End bp2911112 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content55% 
IMG OID644870096 
ProductPeptidase M1 membrane alanine aminopeptidase 
Protein accessionYP_003022287 
Protein GI253701098 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones108 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCGG GGGGGCGTGT GAAAATCCTA TTGATATGCA GTGCCATGGT TCTTATTGCA 
GTAGCAGGGA GAGGGGTTGC CGGAGGTGCC GAGTCTGGTC CACGGGTGAA GGTGCAGGAG
ATTTCCATTT CGCTGGAGCC TGAACGTCAC TTGGTGGTGG GGCAGAGCAG CATCGTTTTC
GAACATGGTG CGGAGAGGGT CTCGTTACGG CTGGCTGAAA CTGCGCGGGT AGAGTCCGCT
AGGGCCGCGG GCAAAGAGAT CCCTTTTTCC TTTGCGAGAG GGGTGTTGGT TCTTGAACTG
CCTCAAGGCG ACAGCGTTAC CATAACCGTC TCCTATCGGG CCGAATTCAA CGACCCCGTT
TCCCGAAACC CGGCGGCAGC TGAGGATCCA AGTTACGGCG TCAGTGCTGC GATCACATCT
AAGGGAACTT TTCTTGGTGG CGGATCCCAT TGGTATCCAG TCCCGTCACA GGTGCCTCTG
AGTCGCAAAA TCAGCATAAT CGCCCCGGCG GGTATCGAGG GCATCACTAA CGGGGCCAGG
ATACTCAGAG AAACATCTGG GGGAGTGACG AAGTCTGTGT GGCAGGAGTC GCGGCCGGTT
GGCGTGCCAT CGGTGAGTGC TGGGCCGTAC CTGGTTGAGG AGAGGCAGGC GGCCGGTGTC
CGGCTTTACA GCTATCTCTA CAGGGACAAT GCCAATTTGG CGCCTCGGTA CTTGGACGCG
GCTGCAAAGT ATCTGTCTTT TTACCAGGGG CTCTTTGGGC TTTATCCTTT TGAAAAGTTT
GCAATTGTAG AAAATTTCTT TCCGACCGGT TATGGCTTCC CGTCCTTTAC GCTGCTCGGT
GGGACTGTAA TCAGGCTTCC CTTCATCGCC GACACAAGCC TTCCCCATGA GATCGTCCAC
TCTTGGTGGG GTAATGGGAT AGACGTCGAT CTGAGCCAGG GGAACTGGTG CGAGGGGCTC
GTCACCTATC TAGCCGATTA CCTGCTTAAG GAACGCCGCT CACCAGCCGA GGCGCTGGAA
TACCGCAAAC AACTCCTGAT CGACTACGCC TCGCTGGTGA CGGCTGAAAA CGACTTTCCA
CTTACCAGTT TTGTCAGCCG TAATGATCCC GCTTCGCGTG CCATAGGGTA CGGCAAGGGT
GCAATGCTGT TTCACATGAT CCGTTCCCAG ATAGGGGATG ACGCGTTCTT CAATGCCTTG
CGGGCAATGG CTCGTGATCG CATGTATGGT TCGGCTTCAT GGAACGATCT TGCTGCGGTT
TTCTCGCGCA GCGCGGGCCG TGACCTTTCT CCTTGGCTGG GGGAGTGGTT GTCTCGTCCA
GGCGGGCCAC GTCTGACCTT TTCGCAGGTG GAAAAGAAAC GTCAGGGGGA GGGATGGCTG
GTGACCGGCA CCATCTTGCA GTCCTCCCCT TCATTCGATG TACGGTTGCC GCTAACTCTA
GAGACCGAGA GCGGAGCGAT TGAGACAGTG TTGCCGGTGC CTGATCAGAA TAGTGCCCGC
TTCGCCATAT CCACGTCAGT ACCGCCGAAG CGGCTCCTGC TTGATCCTGG TGCCTCTATC
TTTCGAATTA TTTCTCATGC AGAAATCCCT GCTACTGTAA ACAGCATTAA AGGATCTACC
TCGCTTGTCG GTGTCATGAC TAAGAATTGT CAGGCACCGC CGGAGTTATT CAAGGCTATG
CTTGCTTCTC TTTCCCAAGC TGACGCACGC GTGTTGATCG AGTCGGCTCT GGATCCGGCC
CAGGCTTTAT CCGACGATCT GGTTTTTTGC GGGATGCCGC AGAACCTTTC TCTGCCCCAG
ATTCCGCAGC AGGTGAGTAC AACCTATAAG TCAACAATTG CAAGCGCAGG CGACGATAGC
CTACTTTTCG TGGTGCTCAA ACGTCATTCC CCACGAACAG GAGTTGTCGC GTTGTTTCAA
CCTGAATCTA GGGTTGCAGC TGAAAAGTAC GCTGGAAAAA TCACCCATTA CGGCAAATAC
GGTTTTCTGA TCTTTTCTGC TGGCTCAATT CGAAACAAGG GGACCGGTAC AGCAGACGTA
GAAGGGGGCT CGATAAATCT CTCGAATTGA
 
Protein sequence
MSAGGRVKIL LICSAMVLIA VAGRGVAGGA ESGPRVKVQE ISISLEPERH LVVGQSSIVF 
EHGAERVSLR LAETARVESA RAAGKEIPFS FARGVLVLEL PQGDSVTITV SYRAEFNDPV
SRNPAAAEDP SYGVSAAITS KGTFLGGGSH WYPVPSQVPL SRKISIIAPA GIEGITNGAR
ILRETSGGVT KSVWQESRPV GVPSVSAGPY LVEERQAAGV RLYSYLYRDN ANLAPRYLDA
AAKYLSFYQG LFGLYPFEKF AIVENFFPTG YGFPSFTLLG GTVIRLPFIA DTSLPHEIVH
SWWGNGIDVD LSQGNWCEGL VTYLADYLLK ERRSPAEALE YRKQLLIDYA SLVTAENDFP
LTSFVSRNDP ASRAIGYGKG AMLFHMIRSQ IGDDAFFNAL RAMARDRMYG SASWNDLAAV
FSRSAGRDLS PWLGEWLSRP GGPRLTFSQV EKKRQGEGWL VTGTILQSSP SFDVRLPLTL
ETESGAIETV LPVPDQNSAR FAISTSVPPK RLLLDPGASI FRIISHAEIP ATVNSIKGST
SLVGVMTKNC QAPPELFKAM LASLSQADAR VLIESALDPA QALSDDLVFC GMPQNLSLPQ
IPQQVSTTYK STIASAGDDS LLFVVLKRHS PRTGVVALFQ PESRVAAEKY AGKITHYGKY
GFLIFSAGSI RNKGTGTADV EGGSINLSN