Gene Gbem_3389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGbem_3389 
Symbol 
ID6780247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter bemidjiensis Bem 
KingdomBacteria 
Replicon accessionNC_011146 
Strand
Start bp3896155 
End bp3897609 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content60% 
IMG OID642769380 
Productaminoacyl-histidine dipeptidase 
Protein accessionYP_002140180 
Protein GI197119753 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00176848 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGATG CAATAAGTGG GTTGGAACCT GAATCGTTTT GGCGCTGTTT TGCAGAAATT 
GCCGGCATTC CGAGACCGTC GGGTCATGAG GCAAGAATAG GCGCTTTCAT CCTGGACCGG
GCGAAGCAAC TGGGCCTGCA AGGGGCGCAG GACGCCTGCG GGAACATCGT GGTCAGGAAA
CCAGCGTCAC CGGGTAAAGA GCGCGTAGCC GGCATCTGCC TGCAGTCCCA CCTCGATATG
GTGTGCGAGA AGAATGCGGA CAAGGTGCAC GATTTCCTCA ACGACCCCAT CGAATTGGTG
CGCAGGGATC AGGTGGTGAC CGCAAACGGC ACCACCTTGG GGGCGGATAA CGGAGTCGGT
GTCGCCGCTT CCCTCGCGTT GATGGAATAC CGGTCCCTTT CACACGGGCC GCTGGAATTC
CTGTTCACGG TAGAGGAGGA GACTGGGCTG ACCGGCGCCA AGAACCTGAG CCCAAGCCTG
GTGCAAAGCA GAACCCTCCT CAACCTGGAC TCGGAGGAAG AAGGGGCGCT CTACATCGGG
TGTGCGGGCG GCAAGGATAC GGTGGGATGC TGGAACTACG CAACTGAAGC GGCACCGGCG
GACGCCGTTG CGCTCGTTGT AGCGGTCAAG GGGCTCAAGG GCGGGCATTC TGGCCTGGAG
ATAGACAAGG GTTTGGGAAA CGCCATCAAG CTGTTGAACC GCGCGCTCTG CAGGCTGTCC
GGAATCGGGG CTAGGGTTGC AGGTATCGAC GGGGGAAACA TGCGGAACGC TATCCCCCGT
GAGGCGACTG CGCAGTTCTA TCTGCCGGCA GCGAAGCTGA CTGAGGCCGA GGCGCTGGTG
CCGGAACTGG ACCTGGTATT CAGGGCGGAA TTGGGGAATG TCGACTCCGG CGTCGTGCTG
GCTATGAGCC GGGATGATGC GGGGAGTGGC AAGGTGATGG ATGCGACGGT TCAGGAGAAG
CTTCTTAAGG CCATCTCCGC GCTTCCCAGC GGCGTCCAGC GCATGAGCCA CGACATTACC
GGACTGGTCG AGACCTCCAC CAACGTTTCT GTCATCAGCA CCAGCGAGAG TGGCGTCACC
CTGGTCACCA GCCAGCGCAG TTCCTCCGCT TCGCGCCTCG GGGAAGTGGT CGAGGGCGTC
GAGTCGATAT TCCAACTGGG TGGTGCGGTG GTGGAAGTGA GCGAGGGGTA TCCAGGGTGG
CAGCCCAACG TCGATTCGGC CATCCTGAAG CTGGCGCTGC AGTGCTACCG TGCGCTTTAT
GACCGCGATG CGGAAGTGAA GGCAATTCAC GCCGGACTCG AATGCGGCAT CATCGGGGAG
CGCATTCCCG GTATGGACAT GATTTCGCTG GGGCCCAACA TGGAAAAGGT GCACTCCCCG
GAAGAGAAGG TGTACATAGA CAGCGTCGCA AATTTCTGGA ACTTCCTGCT GGAGATTTTA
AAGACTGCAC AGTGA
 
Protein sequence
MSDAISGLEP ESFWRCFAEI AGIPRPSGHE ARIGAFILDR AKQLGLQGAQ DACGNIVVRK 
PASPGKERVA GICLQSHLDM VCEKNADKVH DFLNDPIELV RRDQVVTANG TTLGADNGVG
VAASLALMEY RSLSHGPLEF LFTVEEETGL TGAKNLSPSL VQSRTLLNLD SEEEGALYIG
CAGGKDTVGC WNYATEAAPA DAVALVVAVK GLKGGHSGLE IDKGLGNAIK LLNRALCRLS
GIGARVAGID GGNMRNAIPR EATAQFYLPA AKLTEAEALV PELDLVFRAE LGNVDSGVVL
AMSRDDAGSG KVMDATVQEK LLKAISALPS GVQRMSHDIT GLVETSTNVS VISTSESGVT
LVTSQRSSSA SRLGEVVEGV ESIFQLGGAV VEVSEGYPGW QPNVDSAILK LALQCYRALY
DRDAEVKAIH AGLECGIIGE RIPGMDMISL GPNMEKVHSP EEKVYIDSVA NFWNFLLEIL
KTAQ