Gene Ssed_4204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_4204 
Symbol 
ID5611881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp5161997 
End bp5164063 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content51% 
IMG OID640935163 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001475936 
Protein GI157377336 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAGGG CAAATAAACT CACCGTGTTG GCCATGATTT CGGCCGGACT TTTTTTAGGT 
GGCTCGGCAA ACGCGTTAGC CTCGCAGCGA ACTCATCAAA TCACCACCGA TGATTTTTTC
GATATTGGGG GGATGAGTAG CGTACAGTTA AGCCCCGACG GAAAACAGGC CGTCTGGCTC
GAGACGCGTT GGGATAAAGA GCTGGATAAA TCCCAGCGAG ACCTATGGCA GGTTAACACC
AAAAACAAAA AAACCACTCG CCTAACCTTC ACCAACGAGA GTGAGTCCAG CCCACAGTGG
AGCGGCGATG GTCAGTATAT CTACTATATT GGCAAGGTGA CCCATGAAGA CAAGAAAGCC
CCTTACAATG GTAAGAGCCA GGTCTTTCGT ATCAACCGAG ATGGCGGCGA TGCCGTGCCC
GTGACCAAAG AGGTTGAGGG GGTCAGTTCA TTTTCACTCA GTCCCGATGG TAATACCCTC
TATTTCTTAG GCAGCAAGAC GGTCAAAGAT AAAGATGAGT GGGCGTCTAT GCGTGCAGGC
CACAGCGCGC CTAAATACGC CCATGGCGAA AGAAAAACCA ACCCACTGTA TAAGCTAGAT
CTCCAGCATT TCAAACAGCA ACTCATTCTG GATGATGACA AAGTCGTATG GAATTTCAGT
GTCAGCGATG ATGGCAGTAA AGTTGCCCGA ATCACGACAT CAGATAACGA GCTGGTCAAC
CTCGAGGGTT GGTCTGATAT CGAAATTTAT AATACCCGGA GCAAGAGCAA CGAGGTTCTG
GAAGATACGG CATGGCGCGA ACAGGCTCCA TCGCCCTATG GCTGGCTACT CGGACTCGAT
TGGCACTCAG ATAACCAACA GCTCGCCTTT CGTATCGACT TCGATGGCCA TCCGGGCAAG
CTGTTCGTCT CCAATACTAA GACCAAGTCA CTGACCGAAG TGACGCGTAT CGATGATGTG
ACGCTAAACT CGTCAGATAT TCACTGGCGC CCAAACAGTG ATGAGCTCTG CTACCGCGGC
GCCGATCACG CCAGGGTGAA ACTCTTCTGT ACTGAGGTCG AAGACGGCAA ACAGGGAGAT
ACCAGAGCAG TCGTGAATGG CGATCTGGTG ATCGGTAGCT ACAGCTTCAG CCACAATGGC
AAGAAGTTGG CATTTAGCCA TAATGGGCTG GACCACTTCT CAGAGATGTT CATTGCCGAT
GCCAATAGCA AGCGCGCCAA GTTTAAGCGC ATCACCAATA TCAATCCTCA GGTCGATAGC
TGGATACTAC CGCAAATATC CGTCGTAAAA TGGAAGGCCC CCGATGGCAG CACAGTCGAA
GGTATCTTAG ATCTTCCGGC GGGATATAAG AAAGAGGATG GCCCACTGCC ATTAATCGTG
CAGATCCATG GCGGTCCTAC CTCCGCTACA CCTTATGCCC TGCAACACAG ATCCTACGGC
CGCTCTACAT TTACAGCAAA TGGCTGGGCT CTGCTGTCAC CCAACTATCG TGGCTCGACC
GGTTATGGCG ATAAGTTCCT CACCGAACTC GTGGGTCAGG AACATGTCAT CGAGGTCAAT
GACATCATGG CCGGTGTCGA CCATCTGATC GACGAAGGTA TTGTCGATGG CGATAAGATG
GCCGTCATGG GATGGAGCAA CGGCGGCTAT CTCACTAATG CTTTGATCAG TACCAATGAG
CGCTTCAAGG CGGCAAGTTC CGGCGCCGGA GTGTTCGATC AACGCCTGCA GTGGATGCTG
GAAGATACAC CTGGCCACGT GGTCAACTTC ATGGAAGGTC TCCCCTGGGA GAAGCCAGAT
GCTTACACTC ATGGCTCTTC ACTCACCCAC GCAGATAAGA TTAAAACACC AACGCTTATC
CATATAGGTG AGAATGATCA ACGCGTTCCA GTCGGACATG CTCAGGGTCT ATATCGTGCA
CTCAAACATT ACCTCAATGT ACCGGTAGAG TTGATCGTTT ATCCGGGCGA AGGTCATGGA
CTGAGTAAGT ACCAACACAG AAAAGCTAAG ATGGAGTGGG ACCAGAAATG GTTCAACCAC
TACGTCCTCG GCAAAGCTAT CGATTAA
 
Protein sequence
MGRANKLTVL AMISAGLFLG GSANALASQR THQITTDDFF DIGGMSSVQL SPDGKQAVWL 
ETRWDKELDK SQRDLWQVNT KNKKTTRLTF TNESESSPQW SGDGQYIYYI GKVTHEDKKA
PYNGKSQVFR INRDGGDAVP VTKEVEGVSS FSLSPDGNTL YFLGSKTVKD KDEWASMRAG
HSAPKYAHGE RKTNPLYKLD LQHFKQQLIL DDDKVVWNFS VSDDGSKVAR ITTSDNELVN
LEGWSDIEIY NTRSKSNEVL EDTAWREQAP SPYGWLLGLD WHSDNQQLAF RIDFDGHPGK
LFVSNTKTKS LTEVTRIDDV TLNSSDIHWR PNSDELCYRG ADHARVKLFC TEVEDGKQGD
TRAVVNGDLV IGSYSFSHNG KKLAFSHNGL DHFSEMFIAD ANSKRAKFKR ITNINPQVDS
WILPQISVVK WKAPDGSTVE GILDLPAGYK KEDGPLPLIV QIHGGPTSAT PYALQHRSYG
RSTFTANGWA LLSPNYRGST GYGDKFLTEL VGQEHVIEVN DIMAGVDHLI DEGIVDGDKM
AVMGWSNGGY LTNALISTNE RFKAASSGAG VFDQRLQWML EDTPGHVVNF MEGLPWEKPD
AYTHGSSLTH ADKIKTPTLI HIGENDQRVP VGHAQGLYRA LKHYLNVPVE LIVYPGEGHG
LSKYQHRKAK MEWDQKWFNH YVLGKAID