Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0857 |
Symbol | |
ID | 8136178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1021353 |
End bp | 1022807 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644868473 |
Product | aminoacyl-histidine dipeptidase |
Protein accession | YP_003020682 |
Protein GI | 253699493 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2195] Di- and tripeptidases |
TIGRFAM ID | [TIGR01893] aminoacyl-histidine dipeptidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 79 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGATG CAATCCGTGG GTTGGAACCT GAATCCTTTT GGCGTTGTTT CGCAGAAATA GCAGGCATCC CGAGACCGTC AGGTCATGAG GCGAGAATAG GTGCTTTCAT ACTTGATCGG GCCAAGCAAC TGGGCCTCCA AGGCCTGCAG GATGATTGCG GAAATATCGT GGTCAGGAAG CCAGCATCGC CGGGTAAAGA GACGATCGCG GACATTTGCC TGCAGTCCCA CCTCGACATG GTGGGCGAGA AGAATGCGGA CAAGGTACAC GATTTCCTGA ATGATCCTAT CGAACTGGTC CGCAGGGATG AGTTCGTTAC CGCAAACGGC ACCACACTGG GCGCGGATAA CGGAGTCGGT GTCGCTGCTT CCCTCGCGCT CATGGAAGAC CGGTCCCTTC CGCACGGGCC GCTGGAATTC CTGTTCACTG TGGAGGAGGA GACGGGGCTG ACTGGCGCCA AGAATCTGAG CCCGGACCTG GTGCGTAGCA GGACTCTCCT CAACCTGGAC TCGGAGGAAG AAGGGGCGCT CTACATCGGG TGCGCGGGCG GCAAGGACAC GGTGGGAAGC TGGAGCTACG CCGCCGAAGC GGCGCCGGCG GACGTGGTTG CGCTCGTCGT AGCGGTCAAG GGGCTCAAGG GCGGCCATTC GGGCCTGGAG ATAGACAAGG GGTTGGGAAA CGCCATCAAG CTGTTGAACC GCGCGCTCCG CAGGTTGTCC AAAATCGGAG GGAGGGTTGC AGGTATCAAC GGGGGAAACA TGCGCAACGC CATCCCCCGT GAGGCAACTG CGCAGCTGTA TCTACCGGCG GCGAGGCTGG CCGAGGCCGA GGCGCTGGTG TCGGAACTGG ATCTGGTGTT CAGAGCGGAA CTCGGGACGA TCGATCCCGG CGTCGTGCTG ACCATGAGCC GGGATGATGC GGCGTCTGGC AAGGTCATGG ATGCGACGGT TCAAAAGAGC CTCCTGAAAG CCATCTCCGC GCTTCCCAGC GGCGTCCAGC GCATGAGCCA CGACATTGCC GGACTGGTCG AGACCTCCAC CAACGTTTCC GTCATCAGCA CCAGCGATTG CGGCATCACA TTGGTCACCA GCCAGCGCAG TTCATCCGCT TCGCGCCTCG GGGAAGTGGT CGAGAGCGTC GAGTCGATCT TCGAGCTGGG TGGCGCGGTG GTCGAAGTGA GCGAGGGGTA TCCGGGGTGG CAGCCGAACG TCGATTCGGC GGTCCTGAAG CTGGCACTAC AGTGCTACCG TGCGCTCTAT GACCGCGATG CGGAAGTGAA GGCGATTCAC GCCGGGCTCG AATGCGGCAT CATCGGCGAG CGCATCCCCG GCATGGACAT GATTTCGTTG GGGCCCAACA TGGAAAAGGT GCATTCCCCC GAAGAGAAGG TGTACATCGA CAGCGTCGCG AACTTCTGGA CCTTCCTGCT GGAAATTTTA AAGAGTGCAC AGTGA
|
Protein sequence | MSDAIRGLEP ESFWRCFAEI AGIPRPSGHE ARIGAFILDR AKQLGLQGLQ DDCGNIVVRK PASPGKETIA DICLQSHLDM VGEKNADKVH DFLNDPIELV RRDEFVTANG TTLGADNGVG VAASLALMED RSLPHGPLEF LFTVEEETGL TGAKNLSPDL VRSRTLLNLD SEEEGALYIG CAGGKDTVGS WSYAAEAAPA DVVALVVAVK GLKGGHSGLE IDKGLGNAIK LLNRALRRLS KIGGRVAGIN GGNMRNAIPR EATAQLYLPA ARLAEAEALV SELDLVFRAE LGTIDPGVVL TMSRDDAASG KVMDATVQKS LLKAISALPS GVQRMSHDIA GLVETSTNVS VISTSDCGIT LVTSQRSSSA SRLGEVVESV ESIFELGGAV VEVSEGYPGW QPNVDSAVLK LALQCYRALY DRDAEVKAIH AGLECGIIGE RIPGMDMISL GPNMEKVHSP EEKVYIDSVA NFWTFLLEIL KSAQ
|
| |