Gene Sama_0744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_0744 
Symbol 
ID4602997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp910321 
End bp915498 
Gene Length5178 bp 
Protein Length1725 aa 
Translation table11 
GC content55% 
IMG OID639780079 
Productserine protease 
Protein accessionYP_926622 
Protein GI119773882 
COG category 
COG ID 
TIGRFAM ID[TIGR03501] gammaproteobacterial enzyme C-terminal transmembrane domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGTAA AACATCCAAT CAAAGCCAGC GCAGCTGCTG TGTTCGGTGT CCTGTACCTG 
GGGATGTCGG GCTACGCCGC TGCAGAGATT GGCATGGCGA AAGCCGACAA GGGCGGCTTC
TATGTGCCCA CCTTTACCAG CGACGATATC AAGGCATTCA ATGCGTCGCG TAAAGAAGAT
CAAACCGGCG ACCTCTTCCT CGTCCCCGGC AAAGTGAATC ACGTACTGAA TCGCCGTCAG
CACCAAGTGT TTGAGTTCGA CGACTCCATC AAGGGCGAAC ATACCTTTAT CGTGCAGTTC
GATGACAAGC CCGTCGCTAC CTACGATGGT GGTGTGACAG GCTACGCGGC CACCAAGCCT
TTGATGATGC AAAAGTCCGG TGCACTGAAT CCAGGCCAGG CTCAGGCAGC TGAAGTAGTG
CACTACCAGT CCATGCTAAG GAGCAAACAA CAGAGTGTCC TGAATCAGGC CAGTGCCCAC
GGCGCCCGCT TCGAACTGAA GAACCAGTTT ACCCTGGCCA ACAACGCCGC CACGGTTCGC
ATGACCCAGG AAGACGCCGC GCGTATGGCG CAGGTTCCCG GCGTGAAAAA GATTACCCCT
ACCCGGGTGT TCAAGTTGCG TACCGACCGT GGTCCGGAGT TTATTCATGC CGATTCAGCC
TGGAACGGCA ATACCAGCTC TGGTCTGAAG GCCCAGGGTG AAGGCATGGT CGTGGGGATT
ATCGACACAG GTGTGAATAC CGACCATCCA GCGTTTGCCT CAGATGCCGA CTTTACTGCC
AGCCATGAAA AATTGGGCGG TCAGTATCTG GGAGACTGTC AAACCGATGC CAGCCTCTGT
AACGATAAGC TGATTGGCGT TTACTCCTAC GAGGTGATTA CTGAAGTCTA TAACGCGCCC
GAGTTTCAGG ATTACTCCTG GCAGAGCAAG CTTATCCGTC CCCGCAACGG TGAAGATTAC
AACGGCCACG GCTCCCACAC CGCCAGCACC GCCGCAGGGA ACCGCATTGA AAATACACCG
CTGCAAGCGG CCAACGGCGA CAAGGTCAGC GACGGCGTAA ACTTGCCATT CAACTTCGAT
CACACCTCCG GTGTTGCGCC CCGCGCCCAC ATTATTTCTT ACCAGGTGTG CTGGCCAGGG
AGCGGTGGTG ATCCATACGC AGGTTGTCCC GAAGAAGCCA TTCTCGCCGC CTTTGAAGAT
GCCATCCGCG ACGGCGTGGA CGTGATTAAC TTCTCCATTG GTGGCGGCGA AAACTTCCCC
TGGGAAGACC CAATGGAACT GGCATTCCTG TCTGCCCGCG AAGCCGGTAT CTCAGTGGCC
GCCGCTGCCG GTAACTCTGG CCCGTACTTC TACAGTGCTG ACCATACCTC TCCCTGGGTA
ACCACTGTGG GGGCATCCAC CCACGACAGA ACCCTGGATG CGGGTAAAAC CAGTATCACG
GCCTTTGAAT CCACCGGGCC TGCCTACACC ATTCCGAAAA ATGACATCGT GGGCAAAGGC
TTTACCGAAG AAATTTCCGG TCAGTTTGTG CTGGCTGAAA ACTATCCCGA CCCCAACCCC
AATGATGGCT ATGCGGCCAA GACCTGTAAT GCACCTTTCC CGGCCGGCAC CTTCACCGCT
GACCAAATTG TCGTCTGTGA GCGTGGCGAT ATTCCCCGCG TAGATAAGGC AATCAACGTA
CAGGCAGGCG GCGCCGGTGG TTTGGTGTTG CAAAACGTCA GCTATAACGA CCCTCTGGTG
GCCGACCGTT TTGTAATCCC CGGGATCAAC GTGTCTTCCA GCGTGGGTTA CAGCCTGAAA
AACTGGATAA ACCGCAGTAA CGGCACCGCC CGCGGCACCA TCACAGCCCA TGTGAACGAC
TATCTGCTGG ACGAAGAAAA AGGCAATCTG TTGGCCTACT TCAGCTCCAT GGGCCCAAGC
CGCTATATCG ACAACCTGGT ACCGGATGTC ACTGCGCCCG GCGTAAATAT CTATGCCGCC
AACGCCGATG ACCAACCCTT TACCAATTAT CCATCGGCCA GCGACTGGAC CATGATGAGC
GGAACTTCCA TGGCCTCGCC CCATGTGGCC GGCGCCATGA CGCTGCTGAC CCAGTTGCAC
CCTGACTGGA CACCGGCGGA GATCCAATCG GCATTGATGC TGACCGCCAA CGAAGTCAAA
TACCAACCTT ATGCCGGCGC AACACCCGCT GAACTGCCAT ACCACTTCAT GGCAGGTGCC
GGTGCCATTG ATGTGGCAAA GGCCGACGCC ACTGGGCTTA TCATGGATGA AACCATTGAT
GGCTATATGG CTGCCAACCC CAATAACGGT GGCATAGTTA ACTGGCTGAA CCTGCCATCC
ATGGTTGACA TGAACTGTGA GAAAGAATGC ACCTGGATGC GCACAGTCAA GGCCACCAAA
GACGGCAGCT GGTCAGTCGG CACTGAAGTA CGGGAGGACG GTGCTACCCT GGTTGCTACG
CCGAACCAAT TCAGCCTGAA GGCCGGGGAA ACCCAAACCA TCATGGTCAA AATGACGGTA
CCGAGCATCA ATCGCTACGC GGTCGATCCG GATGACGGTG ATTCACCATG GGAAAGCAAT
ACCAACTACG CCCTCTTTAA TGGCAAGCTG ATGCTGACCG AGAGCACCGG CAACTCACCT
GAGTTGCATA TGCCTGTGGT GGCTCTGTCC AACTACGACC AACTGCCTTT TGCCAAGCAA
ATCGAGTTCA ACCGTGAACA GGGCTCAGAA ACTTTTATCG TAAATACCGA CAACTACAGC
CAGTTTACCC CAAGATACTA TGGGCTGGTT AAACCCGAAG TGGAACAGCA TGAGTTGGGT
CTGGTGAGTC CTATCATCAA TATGGCCAAC GTGGAAAAGT GGGGCCTCAG CAAGGTGGTG
GTGCCAGAAG GCACAAAGCG CCTGATGGTG GAGGTACAGT CGGCCGAGGT CATTGGCTAT
GACAACAATC AGAACCCGCG CTATATCAAA CAAGCTCCCG TGCTGACCGT GGGTCTGGAT
GCCAATGGCA ACGATGGTTT TACCCCCTCC CAGGAGGAGA TCGATGCCGA CTACTACGCC
CTCAGGAACG AGTTTTTTTC CGAGATGAAG TGTCAGTCAT CATCCTCTGC GGTGCAGAAC
TACTGTGACA TAGTGGACCC GACTCCCGGC ACCTATTGGA TTGCTCTGAT CAACGTTGGC
AGTGGCGAGC AGAAGTATAA GGTGAATACC GCCGTTGCCG TGATAGGTAA CGACAGCGCA
GCGGGCAACT TCCACCTTGA AGGTCCGGCA TCCCACGATG GCAATGGCAA CTACCAGCTG
ACCCTGAATT GGGATCTGCC GGAAGCGGCG GAAGGCGACG TGTTCTACGG TGGTTTCGAC
ATGGGTAACA TGCCTGGCGA AGAAGGCACC CTGGGCTTTA CCTCCCTCAC CCTGAGACGC
GGTAAGGACA ATGTGTCCTT TAACCTCAGC CAGGACAAGG CCCGCAATAT GGATGTGATC
GAAATCGATC TGTCTATGCT GCCCAACCTG GAAACTCAGG ACAGAGATTT CAGCTTCAAG
CTGACACTGC CCGATGGTAT GCGTTTGGCG CCTGAAACCC TCAAGACGGT TAATGACAAA
GCCCTGACCA ATCTCGAAAT GGATGAAAAG GGCTTCAGCC TCAGTGGTAA CCAGCCAAGC
ACCCGCAATA TCCAGCGGGA GTATGTGGTG ACCAACAGCC TGACGTCTGC CCAGTGCCGT
ACTCCCATCA TTGATGAGTA CTCGGACGGT GGCTATATCG ACTTGCATGA GTTCGGGATG
CAGCCCGACC AAGTCTGGCA CGTGGGAGAC CATCGTGCCT ATAACGATGT CCCTATGAAC
TGGCTGTTCT GGGGCATGGA TCAGGAGCAA TTCAAACTCT ATAACCAGGA CAACGGTGGC
TTTATCCGTA TGCACGCCGT GGGCGCCATG CAGTTCAACA GTGCCTATTG GATGATGAAC
TACGTGCGTG GTCCAGGCTT CCTGTTCGAG TCAATCAACC CCTTCTGGCG CGGCAGCTTC
GAAGCCAAGA ACCGTCGTCA CTGGGAAGAT CCCTGGGGTT TGACCATTGC GGCTCAGTAC
GATGCAGACC GTCCTGATCT CGGCGATCTG CTGTTTATGG AGTTTGATAA CGTCACAGAT
AAGCAAACCG GCGATGAATA TGACTATGAA GTGATATTGC GCCCCAATCT GGACTTCCGT
GATAATCGCT TCGAGATGAT CTTTGCCTAC GACAACCTGG GTGCAAACCT GGCGAAGGGT
ACTATTTTTG TCGAAGGCTT TGACAGCCCT TACTCTACTA ACGTGGGTCC GAAAGATGGC
TATCTCTACA CCATGGTTGG CTTTGACAAT CTGGATGAGG TGCTCGAAGA CAATATGGTG
ATGTGTTTCG ACTACCAGGG TCCCGAGCAG AGCGCCATCG ACATGAAGGT GAAGGCGGTT
GTCCAGCCTG AAGCCGTGGG TAAGACGCTG GAAATCCTGC TGACCCATAG CGTAGAAGGT
CAGGCCGAAA AAACCACCAG CCGCACTATT GTGGTCAACA GCGATCTGAA AGTCGCTGCC
ATGCCCGATA TGCAAGTGGC CGAAGACGGT GAATTGAGCG GCATCGAAGT CTTCTATCTT
GACGCCAACA AGGTCGGCAA CCACCTGCTG GTGAGTGGCG ACCATGTGAC GGCCACTGTC
GATGGCAGCA GCTTCAGCCT CAAGCCCGAT GCCGACTTCT TCGGTGAAAC CCTGGTGACT
GTGACGGTTC AGGACAACGA GCATGCAAGC GATCAGGCCA GCACCAGCTT TATGCTGACA
GTCACCCCGG AGCAGGATGC ACCGGTTGCC AAAACCGCCG AGGCTGAGAT TGCCATCACA
GAGGGTCAAA CCATTACCCT GGATGCCAGC AGCTCAGTAG ATATGGACGG TGACTCTCTG
ACATTCAGCT GGGATGGCCC GGGCACCTTC AGTGACGACA GCGCTGCTGT CACTAAGGTA
ACCGGCCTGT CAGTGGGTGA ACACAGTTTC ACCGTGACAG TGTCTGACGG TATGGATGAA
GCCGAGGCAG AGGTCATAGT GAAGGTGGCC GCAGCTCCCG TCACTGAAAC CACTCCAGCC
AACAACAGCT CAGGCGGCAG CCTGGGTTGG ATGGCCTTGC TGCTGATGGC AGCCGGAGCA
CTGCGCCGTC GCCATTAA
 
Protein sequence
MTVKHPIKAS AAAVFGVLYL GMSGYAAAEI GMAKADKGGF YVPTFTSDDI KAFNASRKED 
QTGDLFLVPG KVNHVLNRRQ HQVFEFDDSI KGEHTFIVQF DDKPVATYDG GVTGYAATKP
LMMQKSGALN PGQAQAAEVV HYQSMLRSKQ QSVLNQASAH GARFELKNQF TLANNAATVR
MTQEDAARMA QVPGVKKITP TRVFKLRTDR GPEFIHADSA WNGNTSSGLK AQGEGMVVGI
IDTGVNTDHP AFASDADFTA SHEKLGGQYL GDCQTDASLC NDKLIGVYSY EVITEVYNAP
EFQDYSWQSK LIRPRNGEDY NGHGSHTAST AAGNRIENTP LQAANGDKVS DGVNLPFNFD
HTSGVAPRAH IISYQVCWPG SGGDPYAGCP EEAILAAFED AIRDGVDVIN FSIGGGENFP
WEDPMELAFL SAREAGISVA AAAGNSGPYF YSADHTSPWV TTVGASTHDR TLDAGKTSIT
AFESTGPAYT IPKNDIVGKG FTEEISGQFV LAENYPDPNP NDGYAAKTCN APFPAGTFTA
DQIVVCERGD IPRVDKAINV QAGGAGGLVL QNVSYNDPLV ADRFVIPGIN VSSSVGYSLK
NWINRSNGTA RGTITAHVND YLLDEEKGNL LAYFSSMGPS RYIDNLVPDV TAPGVNIYAA
NADDQPFTNY PSASDWTMMS GTSMASPHVA GAMTLLTQLH PDWTPAEIQS ALMLTANEVK
YQPYAGATPA ELPYHFMAGA GAIDVAKADA TGLIMDETID GYMAANPNNG GIVNWLNLPS
MVDMNCEKEC TWMRTVKATK DGSWSVGTEV REDGATLVAT PNQFSLKAGE TQTIMVKMTV
PSINRYAVDP DDGDSPWESN TNYALFNGKL MLTESTGNSP ELHMPVVALS NYDQLPFAKQ
IEFNREQGSE TFIVNTDNYS QFTPRYYGLV KPEVEQHELG LVSPIINMAN VEKWGLSKVV
VPEGTKRLMV EVQSAEVIGY DNNQNPRYIK QAPVLTVGLD ANGNDGFTPS QEEIDADYYA
LRNEFFSEMK CQSSSSAVQN YCDIVDPTPG TYWIALINVG SGEQKYKVNT AVAVIGNDSA
AGNFHLEGPA SHDGNGNYQL TLNWDLPEAA EGDVFYGGFD MGNMPGEEGT LGFTSLTLRR
GKDNVSFNLS QDKARNMDVI EIDLSMLPNL ETQDRDFSFK LTLPDGMRLA PETLKTVNDK
ALTNLEMDEK GFSLSGNQPS TRNIQREYVV TNSLTSAQCR TPIIDEYSDG GYIDLHEFGM
QPDQVWHVGD HRAYNDVPMN WLFWGMDQEQ FKLYNQDNGG FIRMHAVGAM QFNSAYWMMN
YVRGPGFLFE SINPFWRGSF EAKNRRHWED PWGLTIAAQY DADRPDLGDL LFMEFDNVTD
KQTGDEYDYE VILRPNLDFR DNRFEMIFAY DNLGANLAKG TIFVEGFDSP YSTNVGPKDG
YLYTMVGFDN LDEVLEDNMV MCFDYQGPEQ SAIDMKVKAV VQPEAVGKTL EILLTHSVEG
QAEKTTSRTI VVNSDLKVAA MPDMQVAEDG ELSGIEVFYL DANKVGNHLL VSGDHVTATV
DGSSFSLKPD ADFFGETLVT VTVQDNEHAS DQASTSFMLT VTPEQDAPVA KTAEAEIAIT
EGQTITLDAS SSVDMDGDSL TFSWDGPGTF SDDSAAVTKV TGLSVGEHSF TVTVSDGMDE
AEAEVIVKVA AAPVTETTPA NNSSGGSLGW MALLLMAAGA LRRRH