Gene GM21_2562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2562 
Symbol 
ID8137904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2990867 
End bp2992057 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content63% 
IMG OID644870171 
Productpeptidase M24 
Protein accessionYP_003022361 
Protein GI253701172 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value2.6562099999999997e-21 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGCTTA CGCCAAAAAA CGAGCTGGAA TACCGCTACA GAAAACTGCA AAGCGAGATG 
GCCGGTGCGG GAATCGACGC GGTCATCATG GTCCAGAACG CGGACCTGTT TTATTTCACC
GGGACCGCGC AGGCGGGAAA TCTTTACGTT CCCGCCTCCG GGCGCCCCAT CTACTTGGTG
CGCAGGGACT ACCTCAGGGC CCGCATGGAA AGTGCTTTGG CCGAGGTGCT CCCTTCGTCC
TCCCTCAACG ATCTCCCAGG CCTTGTTTCC GACCACGGCC ATGGCGCGCC GAAAAGGCTC
GGCCTCGAAC TCGACGTCCT ACCGGTCAAC CTCTACCTCA AATACCGCGC CCTCTACCCC
GACGCCGAAC TGGTGGACGC CTCCCCCCTG ATACGCCGGG TGCGCATGTT GAAGTCCCAT
TACGAAATCC ACATCATGCA GGACGCCGGC AACCAGGCGG ACAAGGTGTA CAAGAAGGCC
GCCGAGACCA TCCGCGAGGG GATCTGCGAG GTGGAGCTCG CCGCCGAACT GGAGCGCGTC
TCCCGCCTTG AAGGGCACCA GGGGTACGTG CGCATGCGGG CCTTCAACGG CGAGGTCGGC
GGAGCCCAGG TGCTCGCCGG GCCGGACGCC GCGGCACCCG CCGCCGGCAA CACCCCGCTG
GGCGGCATGG GGCTCACCCC CGCCTTCGGC CACGGTGCCG GCTACAACAG GATCCGGCGC
GGCGAGCCGG TCGTGGTCGA TTTCGCCAGT TGCTTCGACG GCTACCTGGT CGACCAGACC
AGGATCTTCG CCATAGGCTC CGTCTCGGAC CGGATGCGGC GCGGCTACCA AGACATGCTG
CGCATAGAGG AACTGATGAT GGAGATGGCC GAGGTGGGAA CCAGTTGGGG GGGGATCTAC
CATGCCTGCC TCGATCTGGC GTGCGAGATG GGGTACGCGG ACAACTTCAT GGGGACCCCG
GGCAGCCAGG TTCCCTTCAT CGGCCACGGG CTGGGCATCG AGATTGACGA ATACCCCTTC
ATCGCCCGCG GCTTCGAAAC CGAGACGCTC CAGGTGGGGA TGGCATTTGC CTTCGAGCCG
AAACTCGTGT TTCCCGGGGA AGGGGCGGTC GGCATCGAAA ACTCGTTTTA TCTCTCGGAA
CAGGGACTTA AGAGACTGAC TTTTTCCAGG CAGGAGCTGG TACTGCTCTG A
 
Protein sequence
MRLTPKNELE YRYRKLQSEM AGAGIDAVIM VQNADLFYFT GTAQAGNLYV PASGRPIYLV 
RRDYLRARME SALAEVLPSS SLNDLPGLVS DHGHGAPKRL GLELDVLPVN LYLKYRALYP
DAELVDASPL IRRVRMLKSH YEIHIMQDAG NQADKVYKKA AETIREGICE VELAAELERV
SRLEGHQGYV RMRAFNGEVG GAQVLAGPDA AAPAAGNTPL GGMGLTPAFG HGAGYNRIRR
GEPVVVDFAS CFDGYLVDQT RIFAIGSVSD RMRRGYQDML RIEELMMEMA EVGTSWGGIY
HACLDLACEM GYADNFMGTP GSQVPFIGHG LGIEIDEYPF IARGFETETL QVGMAFAFEP
KLVFPGEGAV GIENSFYLSE QGLKRLTFSR QELVLL