Gene Ssed_3421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_3421 
Symbol 
ID5610720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp4171733 
End bp4173193 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content47% 
IMG OID640934361 
ProductXaa-His dipeptidase 
Protein accessionYP_001475153 
Protein GI157376553 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.11643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGCAT TAAGTCAATT ACAACCTCAA GCCCTGTGGC AGTGGTTCGA ACAAATTTGT 
GCAATTCCCC ATCCATCTAA ACATGAGCAG GCTTTGAGCG AACATATTCA AGCCTGGGCC
AAAGATAAAC AACTCGAACT GGTTGAAGAT AAAGTCGGTA ACCTCATCAT TAAGAAGCCT
GCTACACCGG GTATGGAAAA CCGTAAGGTT GTTGCCCTGC AGGCTCATAT CGATATGGTG
CCGCAGAAGA ACTCTGATAA AACGCATGAT TTCGAGAAAG ACGCGATTGA ACCTTTTATC
GATGGTGAAT GGGTAAAAGC GACCGGAACA ACCTTAGGTG CCGATAATGG TATCGGCATG
GCGTCTGCAT TAGCCATTTT AGGTTCAGAC GATATTCCTC ACGGCCCACT GGAAGTCCTG
CTGACAATAG ATGAAGAAGC GGGCATGACG GGTGCATTTG GACTGGAAGC CGGTTACCTG
AATGCAGATA TTCTGATCAA CACAGACTCT GAGCAGGAAG GCGAGATCTA CATGGGTTGC
GCCGGTGGTG TTGACGGACA GATTAGCGTG CCTATGGTTT GGCAAGCTCC TGAGCAAAGT
CACTCAACCT ACACCTTAAC CCTTTCAGGC TTGAAAGGTG GCCATTCAGG GGTAAACATT
CACTTGGGTC GCGGTAACGC GAATAAACTA TTAGCTCGCT TCCTGTTTAA CCATGCAGAC
GAATTAGCCT TAGAGTTAAC GAACTTCACT GGTGGCTCTC TGCGAAATGC TATCCCACGT
GAAGCCTCGG TAAGCTTTAT GCTGCCAGCT GAAAACATCA CAAAACTTGA TGCCCTGGCG
AAAGAGTTTC AGGCGTTAGT AAGAGAAGAG CTTGCTATTG CCGATCCGGA CATGGTGCTG
GAGCTACTCG AAGCCCCGGC TGCTAAACAA GTGATGAGCG AAGATGCTCA GAATATGCTT
ATCGACCTAC TTAATGCGTG TCCAAATGGC GTTATCCGCA TGAGTGATGA GGTTGAAGGC
GTTACTGAAA CATCATTAAA TGTAGGTGTG ATTAGCACAG AAGCAGAAAG TGTTGAAGTT
CTTTGCTTAA TTCGCTCTCT TATCGATTCT GGTCGCCAGG AAATTGAAAC TGTTTTAACT
TCACTCACCA ACCTAGCCGG TGCTGAAATC CAGTTTAGCG GCGCATATCC AGGCTGGAAG
CCAGACAACA GCTCACCGGT AATGGCATTG GTACGCGAGA CCTACGACAG CATCTACAAC
AAAGAGCCTG TGATCATGGT GATTCATGCC GGACTCGAGT GTGGTCTGTT TAAGAAACCC
TACCCTGAGA TGGATATGGT ATCGATTGGC CCAACCATTC GTTACCCACA CAGTCCGGAT
GAAAAAGTCT TGATTGAAAC CGTTGATCAA TACTACAAGC TACTGTTAGC CGTACTGGAA
CGTATTCCAG AGAAAGGTTA A
 
Protein sequence
MTALSQLQPQ ALWQWFEQIC AIPHPSKHEQ ALSEHIQAWA KDKQLELVED KVGNLIIKKP 
ATPGMENRKV VALQAHIDMV PQKNSDKTHD FEKDAIEPFI DGEWVKATGT TLGADNGIGM
ASALAILGSD DIPHGPLEVL LTIDEEAGMT GAFGLEAGYL NADILINTDS EQEGEIYMGC
AGGVDGQISV PMVWQAPEQS HSTYTLTLSG LKGGHSGVNI HLGRGNANKL LARFLFNHAD
ELALELTNFT GGSLRNAIPR EASVSFMLPA ENITKLDALA KEFQALVREE LAIADPDMVL
ELLEAPAAKQ VMSEDAQNML IDLLNACPNG VIRMSDEVEG VTETSLNVGV ISTEAESVEV
LCLIRSLIDS GRQEIETVLT SLTNLAGAEI QFSGAYPGWK PDNSSPVMAL VRETYDSIYN
KEPVIMVIHA GLECGLFKKP YPEMDMVSIG PTIRYPHSPD EKVLIETVDQ YYKLLLAVLE
RIPEKG