Gene SO_1115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSO_1115 
SymbolpepD 
ID1168949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella oneidensis MR-1 
KingdomBacteria 
Replicon accessionNC_004347 
Strand
Start bp1158327 
End bp1159787 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content48% 
IMG OID637343066 
Productaminoacyl-histidine dipeptidase 
Protein accessionNP_716740 
Protein GI24372698 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACAGCGT TAAGTCAGTT ATACCCACAA CCTTTATGGC AATGGTTCGA ACAAATCTGT 
GCGATTCCCC ACCCTTCCAA ACATGAACAA TCACTCAGCC AACACATCCA AGCTTGGGCT
AAGCAGAAAC AGCTCGATGT GGTTGAAGAT GCTGTCGGTA ATGTCATTAT CCGCAAAGCT
GCGACTCCAG GAATGGAAGA TCGCAAAGTT GTGGTTATTC AAGCCCATAT CGATATGGTG
CCACAAAAAA ATGCCGATAA AGTGCATGAC TTTATTAAAG ATCCCATCGA AGCCTATGTC
GATGGCGACT GGGTTAAAGC TAAAGGCACC ACCTTAGGCG CCGACAACGG CATTGGTATG
GCCTCAGCCT TAGCGATTTT AGGCTCGGAC GATATCAAGC ATGGTCCACT CGAAGTACTG
CTCACTATTG ATGAAGAAGC CGGTATGACA GGCGCATTTG GCCTGCAGGC AGGTATGCTC
AATGCCGAGA TCCTGATCAA CACCGACTCA GAGCAAGAAG GTGAAATCTA TATGGGCTGC
GCTGGCGGCG TCGATGCCCA AATCACCCTG CCTATGGTAT GGCAAGCTTC AGAACAAAGT
TACGCTTCAT TTAGCCTGCA TTTATCAGGC TTAAAAGGTG GCCACTCTGG GGTCAACATT
CATTTAGGCC GTGGTAACGC CAACAAGATT TTGGCGCGCT TCCTGTTTGA AAATGCCGAT
GAGTTAGCGC TGGAATTAAC TCAATTTACC GGTGGTTCAC TGCGCAACGC CATCCCACGT
GAAGCCAATA TCAGCTTTAT GTTACCCGCT GAAAATATCG ATGCGCTCAA GGAAAAAGTG
CATGCCTTTG AAGCGTTAAT GCGTGCCGAA CTCGCTATTG CCGACCCAGA TTTACGTTTA
GTGTTAAGCA ACATTGCCAC GCCCAAACGC GTGATGAGCG AAAACAGCCA AAACACGCTT
ATCGATTTAC TGCATGTTTG CCCTAACGGC GTGATGCGCA TGAGCGATGA AGTGACTGGC
GTGACCGAAA CCTCACTCAA CGTTGGCGTG ATCAGCACTA ACGATGAAGA AGTTGGCATT
CTGTGCTTGA TTCGCTCATT GATCGATTCT GGCCGCAGCC AAGTTGAAGG TATGCTAAAT
GCGTTAACCA ACTTAGCAGG TGCTGACGTT GAATTTAGCG GTGCTTACCC TGGTTGGAAG
CCGGATAACA CTTCGCCTGT AATGGCGATT GTGCGTGAAA CCTACGAGTC TATCTACCAT
AAAGAGCCAG TGATCATGGT GATCCATGCA GGCCTTGAAT GTGGACTGTT TAAAAAGCCT
TATCCCGAAA TGGATATGGT CTCCATCGGC CCAACCATCC GCTATCCACA TGGTCCTGAT
GAAATGGTCA ATATCACCAC CGTTGGTCAG TATTGGGATC TACTCGTAGC CGTGCTAGAA
CGCATTCCCG TTAAAGCCTA A
 
Protein sequence
MTALSQLYPQ PLWQWFEQIC AIPHPSKHEQ SLSQHIQAWA KQKQLDVVED AVGNVIIRKA 
ATPGMEDRKV VVIQAHIDMV PQKNADKVHD FIKDPIEAYV DGDWVKAKGT TLGADNGIGM
ASALAILGSD DIKHGPLEVL LTIDEEAGMT GAFGLQAGML NAEILINTDS EQEGEIYMGC
AGGVDAQITL PMVWQASEQS YASFSLHLSG LKGGHSGVNI HLGRGNANKI LARFLFENAD
ELALELTQFT GGSLRNAIPR EANISFMLPA ENIDALKEKV HAFEALMRAE LAIADPDLRL
VLSNIATPKR VMSENSQNTL IDLLHVCPNG VMRMSDEVTG VTETSLNVGV ISTNDEEVGI
LCLIRSLIDS GRSQVEGMLN ALTNLAGADV EFSGAYPGWK PDNTSPVMAI VRETYESIYH
KEPVIMVIHA GLECGLFKKP YPEMDMVSIG PTIRYPHGPD EMVNITTVGQ YWDLLVAVLE
RIPVKA