Gene Sama_1585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1585 
Symbol 
ID4603837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp1928153 
End bp1930198 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content52% 
IMG OID639780941 
Productcarboxy-terminal protease 
Protein accessionYP_927462 
Protein GI119774722 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00663256 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAAC TATCTCTGGC GGTATCCATT GCCGGGATCC TGGTAGGATC TTCTGCGTGG 
GCCATAGCGC CTGCCATTCA AATCGATGAG CTGCCCTCTC TGGTGCAGGA GCCTCAGCAC
AAGGTGGCCT CCAAGCGGGT AGCGGATCTT TTTACCCGCT CACACTACCA CAGGTTCAGC
CTGGACGATG CCTTCTCCGA ACAGATTTTC GACCGTTACC TTAAGCAGTT GGATTACCGT
CGTAATGTGC TGACTCAAGC CGATGTGCAG AACTTCGAAA AGTATCGCCA TCAGTTTGAT
GACATGCTCA AAAACGGCGA TATGACAGGC GCCTACGACA TGTTCGATCT GGCCCAAAAG
CGCCGGTATG AAGGTTTTGT TTACGCGCTG AGTCTGCTAG ATAAAGAAAT GGACTTTTCT
CAGGCGGGTG ACAAGTACCA GTATGACCGT GAAGACGCTC CCTGGGCTAA GGACGAAGCC
GAAATTCAGG AATTATGGCG TCAGCGCGTT AAATATGATG CGTTGAACCT TAAGTTGACC
GGTAAAAACT GGACTGAAAT CGTTGATGTG CTGCAAAAGC GCTACAACAA CGCCATCAAA
CGCCTGGGCC AAACCCAGAG CGAAGATGTA TTCCAGACAG TAATGAACGC GTTTTCACGC
AGCATCGAGC CCCACACCAG CTATCTTTCC CCACGCAATG CCGAGCGTTT CCAGATGGAA
ATGAACCTCA GCCTCGAGGG CATTGGCGCT GTGTTGCAAA TGGATGACGA CTACACCGTT
ATCAAGAGCA TGGTTGCAGG TGGCCCTGCT GCCAGCAGTG AAAAATTGTC CCCCGATGAC
CGCATCATAG GTGTAGGTCA GGAAGGTGGT GCCGTGGTGG ACGTGATTGG CTGGCGTCTC
GATGACGTAG TCGACCTTAT CAAGGGCCCC AAAGGCAGCA AGGTCACCTT GCAAATTCTA
CCCAAGAAAG GTGGCTCCAA CGCCAAACCA GTCGAAGTGA CCCTGGTCAG GGATAAAATT
CGCCTTGAAG ATCGGGCTGC TACGTCCAAA GTGATTGAGC CCAACGAGGG GCAGTATGCC
AACCGTAAGG TAGGCGTTAT TCAGATCCCC GGTTTCTACA TGAACCTGTC TCAGGATGTG
GCTAAAGAGC TGCAAACCCT GAAAGAGGCG AAAGTAGAAG GTGTGATTAT TGATCTGCGT
GGCAATGGTG GTGGCGCCTT GACCGAAGCT GTGTTGCTGA CAGGGCTCTT TATCGATATG
GGGCCTGTGG TGCAGGTACG CGATGCCAAT GGCCGCGTTT CCCAGCACAG AGACAACGAT
GGCAAGGTCA CTTATTCAGG TCCCCTGACT GTGATGGTGG ACCGTTACAG TGCTTCTGCG
TCTGAAATTT TTGCGGCCGC CTTGCAGGAC TACCAGCGTG CGCTGATTGT TGGTGAGTCA
ACCTTCGGTA AGGGCACAGT TCAGCAGCAT AAGGGCCTGG CGCGCATATA CGATCTGTAT
GAAAAGCCAG TTGGCCATGT GCAGTACACA ATTCAGAAGT TTTACCGTAT CAACGGTGGC
AGCACCCAGC TTAAAGGGGT TACACCGGAT ATCCCGTTCC CCAGTGCCCT CGAGCCTGGA
GAATACGGCG AGGCTGAAGA AGATAACGCA CTGCCCTGGG ATAAGGTACC TGTGGCCCAG
TACAGCACGG TGGATGCTAT CAGTGCCCCC CTCATTGCTG AACTGGATGC CAAACATCAG
GGTCGCATCA AGTCAGATGT GGAATTTGGC TATATTTATC AGGATATTGC TGAATATAAA
AAGCACCACG ATGAAAAGTC TGTATCACTG GTGGAAAGTG AGCGCATTGC TGAGCGTGAA
GCTGATGACA AGAAACAGCT CGATCGCACC AATGAACGCC GCACCCGCGC CGGCCTCGAT
AAGGTAGCGA GCCTGGATGA TATCGAGAAA GATATCGAAG CGCCGGATCC ACTGCTGGAT
GAAACTGCCT ATATCACACT GGATCTGGTT GATGCGGGCA AACTTGCGGC AACCAGCAAG
CACTGA
 
Protein sequence
MRKLSLAVSI AGILVGSSAW AIAPAIQIDE LPSLVQEPQH KVASKRVADL FTRSHYHRFS 
LDDAFSEQIF DRYLKQLDYR RNVLTQADVQ NFEKYRHQFD DMLKNGDMTG AYDMFDLAQK
RRYEGFVYAL SLLDKEMDFS QAGDKYQYDR EDAPWAKDEA EIQELWRQRV KYDALNLKLT
GKNWTEIVDV LQKRYNNAIK RLGQTQSEDV FQTVMNAFSR SIEPHTSYLS PRNAERFQME
MNLSLEGIGA VLQMDDDYTV IKSMVAGGPA ASSEKLSPDD RIIGVGQEGG AVVDVIGWRL
DDVVDLIKGP KGSKVTLQIL PKKGGSNAKP VEVTLVRDKI RLEDRAATSK VIEPNEGQYA
NRKVGVIQIP GFYMNLSQDV AKELQTLKEA KVEGVIIDLR GNGGGALTEA VLLTGLFIDM
GPVVQVRDAN GRVSQHRDND GKVTYSGPLT VMVDRYSASA SEIFAAALQD YQRALIVGES
TFGKGTVQQH KGLARIYDLY EKPVGHVQYT IQKFYRINGG STQLKGVTPD IPFPSALEPG
EYGEAEEDNA LPWDKVPVAQ YSTVDAISAP LIAELDAKHQ GRIKSDVEFG YIYQDIAEYK
KHHDEKSVSL VESERIAERE ADDKKQLDRT NERRTRAGLD KVASLDDIEK DIEAPDPLLD
ETAYITLDLV DAGKLAATSK H