Gene GM21_4109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4109 
Symbol 
ID8139483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4692479 
End bp4693570 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content70% 
IMG OID644871724 
Productpeptidase M24 
Protein accessionYP_003023882 
Protein GI253702693 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones102 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGCA CCGCGCGCGC GGAGGAGATG GCCGGGAAGA TCCGCCTGGT CCGGGAATTG 
CTCGGCGAGG GAAGGGTGCT GAGGCTCAAG GGGATCGACT GGTTCTCCTG GATCACGGCC
GGGGGCTCCA ACGAGGTGCT CTTGGCGGCC GAGACCGGGA TCGCCGAGTT CGTGGTGACC
GGGCGCGGCG CTTTCGTGGT GACCAACGAG ATCGAGGCGC AGCGCCTGAT CGACGAAGAG
CTCCCCCCCG GCTGCGAGCT GCGCATCCTC CCCTGGGCCT ACCCTTCCCA GTTGGAGGTG
GTGATGAGGG AGCTGGCCGA AGGGACGCCG GTCTACTCCG ACCGCCCGGC CGAGGCGGAG
CGGGAGCTCC CGCTGCCGCT TCTGGCGGCC AAGCGCACCC TCTGTCCTGC CGAGCTGCAC
CGCTACCGCG AGGTGGGGCT TCTGGCCTCG CAGGCGATGA CCGAGACCCT GCAGCAGGCG
AACCCCGACT GGAGCGAGTA CCGCCTTGCC GCGGCAGGCG CCTGCGCCCT CCTCTCGCGC
GGGCTCGCCC CCTGCCTGAT CATGGCTGCC GGGGACAGGC GCCGCCGGCT GTACCGCCAT
CCGATCACCA ACAGGGACCC GCTGGGCGCC TCCGCGATGC TGGTCTTCTG CGCCCGGGGG
TACGGCCTCT ACGCCAACCT CACCAGGTTC GTCGCCTTCG GGCCCCTATC CGACGAAGAG
GAGCAAAAGC ACGCGCAGGT GCGCGAGATC GAGGCCCACG CGCTGCTCCT CTCCCGCCCG
GGGGTCCTTT TGCACGAGGT CTACCGCGAG CTTGCCTCGG CCTACGCCGC CGCGGGTTAC
GAGCACGCGG TCAAGGAGCA CCACCAGGGC GGGATCACCG GCTACCTTTC CCGGGAGGCG
ATAGCGAATC CCGAGGCGCG GGAGCACCTG AGCGCCGGGA TGGCCGTCGC CTGGAACCCC
AGCCTCCCCG GCGCGAAAAT AGAGGATACC TTTCTGGTGA CAGAGACCGG AGTCGAAAAC
CTGACGCTCG ACCCGGCCTG GCCGACGGTG CAGGCGGCCG GTCTGGAACG GCCGCTCGTC
CTCAGACGAT AG
 
Protein sequence
MNGTARAEEM AGKIRLVREL LGEGRVLRLK GIDWFSWITA GGSNEVLLAA ETGIAEFVVT 
GRGAFVVTNE IEAQRLIDEE LPPGCELRIL PWAYPSQLEV VMRELAEGTP VYSDRPAEAE
RELPLPLLAA KRTLCPAELH RYREVGLLAS QAMTETLQQA NPDWSEYRLA AAGACALLSR
GLAPCLIMAA GDRRRRLYRH PITNRDPLGA SAMLVFCARG YGLYANLTRF VAFGPLSDEE
EQKHAQVREI EAHALLLSRP GVLLHEVYRE LASAYAAAGY EHAVKEHHQG GITGYLSREA
IANPEAREHL SAGMAVAWNP SLPGAKIEDT FLVTETGVEN LTLDPAWPTV QAAGLERPLV
LRR