Gene GM21_0857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0857 
Symbol 
ID8136178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1021353 
End bp1022807 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content61% 
IMG OID644868473 
Productaminoacyl-histidine dipeptidase 
Protein accessionYP_003020682 
Protein GI253699493 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATG CAATCCGTGG GTTGGAACCT GAATCCTTTT GGCGTTGTTT CGCAGAAATA 
GCAGGCATCC CGAGACCGTC AGGTCATGAG GCGAGAATAG GTGCTTTCAT ACTTGATCGG
GCCAAGCAAC TGGGCCTCCA AGGCCTGCAG GATGATTGCG GAAATATCGT GGTCAGGAAG
CCAGCATCGC CGGGTAAAGA GACGATCGCG GACATTTGCC TGCAGTCCCA CCTCGACATG
GTGGGCGAGA AGAATGCGGA CAAGGTACAC GATTTCCTGA ATGATCCTAT CGAACTGGTC
CGCAGGGATG AGTTCGTTAC CGCAAACGGC ACCACACTGG GCGCGGATAA CGGAGTCGGT
GTCGCTGCTT CCCTCGCGCT CATGGAAGAC CGGTCCCTTC CGCACGGGCC GCTGGAATTC
CTGTTCACTG TGGAGGAGGA GACGGGGCTG ACTGGCGCCA AGAATCTGAG CCCGGACCTG
GTGCGTAGCA GGACTCTCCT CAACCTGGAC TCGGAGGAAG AAGGGGCGCT CTACATCGGG
TGCGCGGGCG GCAAGGACAC GGTGGGAAGC TGGAGCTACG CCGCCGAAGC GGCGCCGGCG
GACGTGGTTG CGCTCGTCGT AGCGGTCAAG GGGCTCAAGG GCGGCCATTC GGGCCTGGAG
ATAGACAAGG GGTTGGGAAA CGCCATCAAG CTGTTGAACC GCGCGCTCCG CAGGTTGTCC
AAAATCGGAG GGAGGGTTGC AGGTATCAAC GGGGGAAACA TGCGCAACGC CATCCCCCGT
GAGGCAACTG CGCAGCTGTA TCTACCGGCG GCGAGGCTGG CCGAGGCCGA GGCGCTGGTG
TCGGAACTGG ATCTGGTGTT CAGAGCGGAA CTCGGGACGA TCGATCCCGG CGTCGTGCTG
ACCATGAGCC GGGATGATGC GGCGTCTGGC AAGGTCATGG ATGCGACGGT TCAAAAGAGC
CTCCTGAAAG CCATCTCCGC GCTTCCCAGC GGCGTCCAGC GCATGAGCCA CGACATTGCC
GGACTGGTCG AGACCTCCAC CAACGTTTCC GTCATCAGCA CCAGCGATTG CGGCATCACA
TTGGTCACCA GCCAGCGCAG TTCATCCGCT TCGCGCCTCG GGGAAGTGGT CGAGAGCGTC
GAGTCGATCT TCGAGCTGGG TGGCGCGGTG GTCGAAGTGA GCGAGGGGTA TCCGGGGTGG
CAGCCGAACG TCGATTCGGC GGTCCTGAAG CTGGCACTAC AGTGCTACCG TGCGCTCTAT
GACCGCGATG CGGAAGTGAA GGCGATTCAC GCCGGGCTCG AATGCGGCAT CATCGGCGAG
CGCATCCCCG GCATGGACAT GATTTCGTTG GGGCCCAACA TGGAAAAGGT GCATTCCCCC
GAAGAGAAGG TGTACATCGA CAGCGTCGCG AACTTCTGGA CCTTCCTGCT GGAAATTTTA
AAGAGTGCAC AGTGA
 
Protein sequence
MSDAIRGLEP ESFWRCFAEI AGIPRPSGHE ARIGAFILDR AKQLGLQGLQ DDCGNIVVRK 
PASPGKETIA DICLQSHLDM VGEKNADKVH DFLNDPIELV RRDEFVTANG TTLGADNGVG
VAASLALMED RSLPHGPLEF LFTVEEETGL TGAKNLSPDL VRSRTLLNLD SEEEGALYIG
CAGGKDTVGS WSYAAEAAPA DVVALVVAVK GLKGGHSGLE IDKGLGNAIK LLNRALRRLS
KIGGRVAGIN GGNMRNAIPR EATAQLYLPA ARLAEAEALV SELDLVFRAE LGTIDPGVVL
TMSRDDAASG KVMDATVQKS LLKAISALPS GVQRMSHDIA GLVETSTNVS VISTSDCGIT
LVTSQRSSSA SRLGEVVESV ESIFELGGAV VEVSEGYPGW QPNVDSAVLK LALQCYRALY
DRDAEVKAIH AGLECGIIGE RIPGMDMISL GPNMEKVHSP EEKVYIDSVA NFWTFLLEIL
KSAQ