Gene GM21_1377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1377 
Symbol 
ID8136705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1624803 
End bp1625969 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content61% 
IMG OID644868991 
Productpeptidase M24 
Protein accessionYP_003021194 
Protein GI253700005 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value0.209377 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAACA GGCAGGAATC GCAACTAAGA ATCGCCAGGC TGCAACAGGA ACTTAAGGCA 
AAGGGTATCG ACGGGGCGCT CTTCATCTAC CCCATCGACG TCTACTATTT CACCGGCACC
CGCCAGAATT CGACGCTCTG GGTCCCCGCC GAGGGCAAAC CGCGCCTGAT GGTGCGCAAA
AGCGTCTCCA GGGCGGTCAA GGAAAGCTTA ATCGAGGAGA CCGTCCCCTT TCCGTCCAGC
AAGGAGTTCC CGGCGCTGTT CCCGCCCGAG ATGCAGAAGA TAGGGTTCAC CTTCGACGTG
GCGCCGGTGC AGCAGTACAA CTACTACGCG AAGCTTTTGC CGGGACGCGA GTTTGTCGAC
GTCTCCGCCA TCAACCGCGA AATCCGCTCG GTGAAGTCGG AGTGGGAACT GGGGCAGATG
CGGCAAAGCG GCGACATGAT CTGCCAGGTC TTCAGGGAGG TTCCGGGATT CCTGAAGGAA
GGGATGCGCG AGGTGGACCT GGCGGCAGAG TTCGAATGCC GGCTGAGAAA GGCCGGGAGC
GAAGGTTACG TGCGCATGCG CGCCTTCAAT CAGGAGCTGT TCCAAGGGCT CGCGGTTTCA
TCCGCGGCCT GCGACCCCGG CTTCTTCGAC GGCGCCGTGA CCGGGCAGGG GATGTCCAGT
GCCTCCCCGC ATGGCGCATC CGCCGCGGTA ATCAAAGCCA ATACCCCTAT CCTCGTCGAC
TATACCGGCA TCTTCAACGG CTACATCGTT GACATAACTC GCTTTTTCGT CATCGGCAAG
CTGGCGCCCG AGTTGGAGCA CGGCTTCGCC ACGGCGCTCG CCATCCAGAA ATACCTGGTC
GACAACCTGA AGCCGGGGGT GGTCTGCGAG GAGTTGTTCC TGAAGGCGGC CGAGATGGCG
GAAGCCGCGG GCTTGGCCCG GAACTTCATG GGGGCCCCCG GAGAGAACGC CAAGTTCGTG
GGGCACGGGG TCGGGTTAGA GCTGGACGAG TTCCCGGTAC TGGCGCAAGG GTTCAAGGTG
CCGCTGCAGG AAGGGCAGAC CATTGCCATC GAACCGAAAT TCGTCTTCCC GGGCCAGGGT
GTGGTCGGGA TAGAGAACAC CTTTGCTGTC GGCAAAAACG GCGGCGTGAA ACTGACCGAC
ATGCCGGACG AGGTCGTGTA CCTGTAA
 
Protein sequence
MLNRQESQLR IARLQQELKA KGIDGALFIY PIDVYYFTGT RQNSTLWVPA EGKPRLMVRK 
SVSRAVKESL IEETVPFPSS KEFPALFPPE MQKIGFTFDV APVQQYNYYA KLLPGREFVD
VSAINREIRS VKSEWELGQM RQSGDMICQV FREVPGFLKE GMREVDLAAE FECRLRKAGS
EGYVRMRAFN QELFQGLAVS SAACDPGFFD GAVTGQGMSS ASPHGASAAV IKANTPILVD
YTGIFNGYIV DITRFFVIGK LAPELEHGFA TALAIQKYLV DNLKPGVVCE ELFLKAAEMA
EAAGLARNFM GAPGENAKFV GHGVGLELDE FPVLAQGFKV PLQEGQTIAI EPKFVFPGQG
VVGIENTFAV GKNGGVKLTD MPDEVVYL