Gene Sama_0439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_0439 
Symbol 
ID4602694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp539199 
End bp542654 
Gene Length3456 bp 
Protein Length1151 aa 
Translation table11 
GC content55% 
IMG OID639779775 
Productsubtilisin-like serine protease-like protein 
Protein accessionYP_926319 
Protein GI119773579 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0564398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAGC TGCACCCACG CCTGAGCCGA TTGACTTTGG CCATGCTCTC AACTTCCCTG 
ATGGCTGTTA CCGCTCCGGC GCTGGCCGCC AAGAAGGTTG AACCCAAACA AGACTTCGAC
AATTCCTCTG TTATCGTTAA ATTCAAAGAA ACCGCAAAGA AAGCGGACCG CAAGCAGTTG
CTTGCTCAGT ACGGTGTGTC ATTTAAAGAC AAAAATGACG ACGGTGTGGA TGACCGTTTC
CGCAATATCG CCAAGGGACG TCTGGCTGAA CTCACGGTGC CCCGGGGGCT GGATGCCCGT
CTTATGGTGG AGCGTCTCAA GCACAATCCC CACATTGAAT ACGCCGAACT CAACCACAGA
TTCTATCCCT CAGTGGTGCC AAACGACCCA AGCTACAGCC AGCTCTGGGG CATGCCAAAA
ATCCATGCCG AGCAGGCGTG GGAAATGGAA ATGGGCTCAC GAGAGATAGT GGTTGGTGTG
ATTGACACTG GTTTTGACTA CAACCATCCG GATCTGCGCG ACAACATTTG GGTTAACCCC
AACGAAGTGC CCAACAACGG GATCGACGAT GATGGCAACG GCTATATCGA CGATATGCAC
GGTATTTCGG CTATCAATGA CAACGGTAAT CCTCAGGACA CTCACTATCA TGGTACTCAC
GTTGCAGGCA CCATAGGCGC CACCGGTAAC AATGGCACCG GTGTGGTTGG TGTTAACTGG
AACACCGCCA TGGTGGGCTG TTCCTTCCTT GGCAGTCAGG GTGGCACCAC CGCCGATGGT
ATCCAGTGTA TTGATTACAT GGTTGATTTG AAAAACCGTG GCGTGAACAT CCGCGTACTG
AACAACTCAT GGGGCGGTGG CTCCTTCAGT CAGGCGCTGG AAGATGCCAT CACGGCGGCC
AACAATGCCG ATATCCTGTT TGTGGCCGCA GCCGGTAACG ATGCCGTGGA CAACGATGTC
AACGACAGCT GGCCTGCCAA CCATGATGTG CCCAACGTGA TGTCGATTGC CTCCACCACC
CGTGATGACC AGATGTCCTA CTTCTCCCAG TGGGGCTTGA ACACTGTGGA TATGGGCGCG
CCTGGTTCTG ATGTGTACTC CACCATTCCC GGTAGCGATT ACAACACTCT GAGTGGTACT
TCCATGGCAA CGCCCCACGT GGCCGGTGCC GCGGCGCTGA TTTTGGCAGC CGATCCCTCG
CTGACCACGG CTGATGTGAA AAACATCCTG ATGGCCTCCG GTGACCCCAT TGCCGCGCTG
GAAGGCAAGA CAGTGACAGG TAAGCGCCTG AATCTGGAAG GTGCGCTGAA CATGGCTGGT
GCAGGTGGCC CAGGCTACTA CCTGCTGGTG AGCCCTGCCA GCCGCACTGT GAATCAGGAC
TCTTCAGTTA CCTTTGATAT CGATATGAAC GCCGTGGGTG GTTACAACGG CAATGCCAGC
TTCAGCGCCG ATGTGCCTGC TGGGCTGAAT GCGGCCGTGA CCTTCTCCAG CAGCACTGTG
CCTGCCGATG GTCGCACTAC CATGACAGTG TCCACCGATG CCAACACGAG TTTGGGTAAC
CACGTTATCA CCATCAATGC GGTGGACGGT GATATCCACA AGAGCATAGA TGTGAGCCTC
TTGGTGTATC CTGCCGGTAC CTTCAGCACT ACTTACAGCA ACGATAATCC CGTGGCCATT
CCCGATGACA ATGCCGATGG TGTGAGCAGT GTTATCAATG TACCCCTGAA CCTGACTCTG
ACGGATCTGG TGGTGAATGT GGATATCGCC CACACCTACA TTGGCGATTT GACCGTTACC
CTCACATCGC CAAGTGGCCG TGCCGTCACC CTGCATAACC GCACCGGTGG CAGCGCCGAT
AATCTGGTGG CCAGCTACTC AGTGGAAGAC TTTGATCTGG AAGATGCATC AGGTGATTGG
ACCTTGCATG TTGTCGACTC AGGTTTCCGT GACGTGGGTA CCCTCAACAG CTGGAGCATG
GATGTCACCG GTGGTTCTCA GCCCGGTACC AACCTGCCAC CGACCGTGAC CATCGGCGCC
AACCTGCAAA ATGCCCTGTA TCTGCCGGGC GATGTTATCA ACTTCGTGGC CGATGCGACC
GACTCTGAAG ACGGTGACGT TCGCGCCTCG CTGGTGTGGA CCTCCAGCCT GGATGGCCAG
ATTGGCACGG GTGGCAGCTT CTCCCGCTCG GATTTGAGCC AGGGTACACA CCTGATCACT
GTGGCCGCGT CTGACAGCCA GGGTGTGGTG AGCAGCCGCG AGTTCTTTCT GTACGTTGTT
AGTGACGGTA CCGTTGTTTC CTACGAGGAC ACGAACCGTC AATCTATTGT GGACCTGGGA
ACTGTAGTGG CCGAGATTGA AGTGCCCCTC GGACTGAAGA TCAAGGACAT GAGCCTGTTC
GTGGATATTC AGCACAGCTT TGCCAACGAT ATGCTCATTC ATCTGGTATC GCCCAACGGT
ACCCGCGTGG AGATTTTCGA TCGCAAAGAG TCAGGCGAAT ACTATCGTGA CTTGGTGAAA
ACCTTCTACC CAGTTGAATT TAATGGCGAA ATGGCCGCGG GCACTTGGCA GCTCGTCATC
AAGGATGAGT GGAGCAATAA CTCCGGCTGG CTGAACCGTT GGGTTTTGGG CTTTACCCAT
GATGGTGGCA CTACTACCCC AGACAACCAG GCACCAGTGG TGAGCATCAA TGCTCCCGTG
GGTGGCAGCA GTGTTATCGA AGGTGACACA GTAACCTTTG TTGGCTCTGC GATGGATGCC
GAAGACGGTG ATGTGACCAA CACTCTGGTA TGGAGTTCCG ATCTGGACGG TGTGATTGGT
AATGGTGCCT ACTTCAGCAC CAACAACCTG AGCGTGGGCC AGCACACAGT GACCGCCAGC
GCTGCCGACA GTGAGGCAGC CAGTGGTAAC GCCATGGTTA CCCTGACGGT AGAAGCCGCC
CCGGTGAATG AGTTGCCTGT GGCTGAGTTC AGCTTCCAAG TGAATCATCT GGATGTGACC
TTTACCGATG GCTCTGGTGA TGCTGATGGT TCCGTGGTTG CCTGGGCATG GGACTTGGGT
GATGGCAACA CATCATCACT GGCGAATCCA AGCCACAGCT ATGCCACCGG TGGCAGCTAT
CAGGTGACGC TGACAGTGAC CGACAACAAT GGCGCCAGTC ACAGCATCAG CAAGCAGGTC
TCTGTGAAAG CCTCCATCAG TCTTGATGCC GCTGGCAGCA CCGACGGTAA CAAGGTAAAC
ATCAGCCTGA GCTGGAGTGG TTCTACCGCC CGCAACGTTG ATGTGTATCG CGATGGCCAG
CTGATCAACA GTGTCCGTGA CCGTGGCAGC TTCAGCGACC GCTTCAACAG CAGTGAAGGC
AGCTTTGCCT ATCAGGTCTG TGAAGCTGGC AGCGATATTT GCTCCGAAGT CATCACAGTG
ACCCCTGTGT TAAACACCCG CGGTAAGGGT AAGTAA
 
Protein sequence
MRKLHPRLSR LTLAMLSTSL MAVTAPALAA KKVEPKQDFD NSSVIVKFKE TAKKADRKQL 
LAQYGVSFKD KNDDGVDDRF RNIAKGRLAE LTVPRGLDAR LMVERLKHNP HIEYAELNHR
FYPSVVPNDP SYSQLWGMPK IHAEQAWEME MGSREIVVGV IDTGFDYNHP DLRDNIWVNP
NEVPNNGIDD DGNGYIDDMH GISAINDNGN PQDTHYHGTH VAGTIGATGN NGTGVVGVNW
NTAMVGCSFL GSQGGTTADG IQCIDYMVDL KNRGVNIRVL NNSWGGGSFS QALEDAITAA
NNADILFVAA AGNDAVDNDV NDSWPANHDV PNVMSIASTT RDDQMSYFSQ WGLNTVDMGA
PGSDVYSTIP GSDYNTLSGT SMATPHVAGA AALILAADPS LTTADVKNIL MASGDPIAAL
EGKTVTGKRL NLEGALNMAG AGGPGYYLLV SPASRTVNQD SSVTFDIDMN AVGGYNGNAS
FSADVPAGLN AAVTFSSSTV PADGRTTMTV STDANTSLGN HVITINAVDG DIHKSIDVSL
LVYPAGTFST TYSNDNPVAI PDDNADGVSS VINVPLNLTL TDLVVNVDIA HTYIGDLTVT
LTSPSGRAVT LHNRTGGSAD NLVASYSVED FDLEDASGDW TLHVVDSGFR DVGTLNSWSM
DVTGGSQPGT NLPPTVTIGA NLQNALYLPG DVINFVADAT DSEDGDVRAS LVWTSSLDGQ
IGTGGSFSRS DLSQGTHLIT VAASDSQGVV SSREFFLYVV SDGTVVSYED TNRQSIVDLG
TVVAEIEVPL GLKIKDMSLF VDIQHSFAND MLIHLVSPNG TRVEIFDRKE SGEYYRDLVK
TFYPVEFNGE MAAGTWQLVI KDEWSNNSGW LNRWVLGFTH DGGTTTPDNQ APVVSINAPV
GGSSVIEGDT VTFVGSAMDA EDGDVTNTLV WSSDLDGVIG NGAYFSTNNL SVGQHTVTAS
AADSEAASGN AMVTLTVEAA PVNELPVAEF SFQVNHLDVT FTDGSGDADG SVVAWAWDLG
DGNTSSLANP SHSYATGGSY QVTLTVTDNN GASHSISKQV SVKASISLDA AGSTDGNKVN
ISLSWSGSTA RNVDVYRDGQ LINSVRDRGS FSDRFNSSEG SFAYQVCEAG SDICSEVITV
TPVLNTRGKG K