Gene Sbal223_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_1904 
Symbol 
ID7090071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2246018 
End bp2248063 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content46% 
IMG OID643460808 
Productcarboxy-terminal protease 
Protein accessionYP_002357832 
Protein GI217973081 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.239407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.209201 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAAC TCACTTTGGC TACCTCGATC GCCACTGTTT TTGTCGGATT CTCGGCTTGG 
GCCGTACCAC CCACGATTCA AATCAGCGAG TTACCCACTC TCAAGCAGGA AGCGCAACAT
AAGGTCGCGA GCAAAAGGGT AACGGACTTA TACACTCGTT CGCATTATCA TAGATTTAAT
CTTGATGATG CCTTTTCTAC GCAAATTTTT GACCGTTACC TGCAGCAGTT AGACTATCGC
CGCAATGTGT TAACCCAAGC GGATGTCGAT AGCTTTAAAC CTTATGCTAC CCAATTCGAT
GACATGCTCA GTTCTGGTGA ACTTGAGCCT GCCTATAAAA TGTTTGATAT CGTCCAAAAG
CGACGTTATG AAGGCTTTGT CTATGCTTTA TCCCTACTGG ATAAAGAGAT GGACTTCTCC
GCCCCCGGCG ATGCCTATGA GTATGACCGT GACGATGCTG CATGGCCAAA AGATCAAACC
GAGATCAATG AGTTATGGCG TCAACGTGTT AAATACGATG CGCTAAATCT GAAGCTCACA
GGTAAAAAAT GGCCTGAGAT TGTTGAGATC TTGCAAAAGC GTTACAACAA CGCCATCAAA
CGTTTGACTC AAACCAACAG TGAAGACGTG TTCCAAGGCG TGATGAATGC CTTCTCACGC
AGCATTGAGC CGCATACCAG TTACCTGTCA CCACGCAATG CTGAACGTTT CCAAATGGAA
ATGAATTTAA GCCTTGAAGG AATTGGTGCT CAACTGCAAC TTGAAGACGA TTACACTGTT
ATTAAAAGCT TAATTGCTGG CGGCCCTGCG GCTGGCAGCG AAAAACTGTC GCCGGAAGAC
AAGATTGTCG GCGTCGGTCA AGAAGGCGGT GAAATTGTTG ATGTGATCGG CTGGCGTTTA
GACGACGTGG TTGATCTGAT AAAAGGGCCT AAGGGCAGTA AAGTCGTACT GCAAATTTTA
CCGAAGAAAG GCGGTTCGAA TGCTAAGCCA TTCGACGTTA CCTTAGTACG CGATAAAATT
CGTCTTGAAG ACAGAGCGGC AACCTCTAAG GTGATTGAAC CTAAAGATGG CGAGTACGCT
AATCGCAAAG TCGGTGTGAT CCATATTCCT GGTTTCTACA TGAATCTGTC ACAGGACGTT
GAAAAAGAGC TAGTGAAGTT AAACGAAGCC AAAGTTGAAG GCATAGTCAT CGACCTTAGA
GGTAACGGTG GTGGTGCATT AACAGAAGCA GTGTTGCTGA CGGGACTCTT TATCGATATG
GGCCCTGTGG TGCAAATTCG TGATGCCGAT GGCCGTGTGT CTGCGCACCG TGATAACGAT
GGCAAAACCA GTTATGCAGG CCCGTTGACT GTGATGGTTG ACCGTTACAG TGCTTCAGCC
TCTGAGATTT TTGCTGCCGC ACTACAAGAC TATGACCGCG CGCTGATTGT TGGTGAGTCT
AGCTTTGGTA AAGGTACAGT GCAGCAGCAT AAGAGCTTAG GCCGTATCTA TGATATGTAT
GAAAAGCCAA TTGGTCATGT GCAATACACG ATTCAAAAGT TCTATCGGAT CAATGGCGGC
AGCACACAGC TTAAAGGTGT GACACCAAAC ATTGCTTACC CAAGTGCGTT AGAGCCAGGT
GAGTACGGCG AAGCTGAAGA GAAAAATGCC TTGCCTTGGG ATAAAGTGCC AATGGCGCAA
TATGGCACGC TCAATGATGT GACGCCTGAG TTAGTCACTA ACCTTGAGGC CAAGCACCTT
AACCGCATTA AAAGCAGTGT AGAGTTTGCT TATATCAATC AAGATATTGC CGACTTTAAA
AAGCACCACA AAGAGAAGAC AGTTTCTCTC GTTGAAAGTG AGCGCATTGC CTCCCGCGAA
GCCGATGAGA AGAAAGTCCT CGATAGAACG AACGAGCGCC GAGTCGCCAA TGGTTTAGCA
CCGGTTAAAT CAATGGAAGA CATTAAAGAC GACGCCGAAT TGCCAGACGC ATTTTTAGAT
GAAACGGCTT ATATCACTTT AGATATGGCT GATGCGCAAA AACTGGCTAA AACTAGTGCT
AAGTAG
 
Protein sequence
MRKLTLATSI ATVFVGFSAW AVPPTIQISE LPTLKQEAQH KVASKRVTDL YTRSHYHRFN 
LDDAFSTQIF DRYLQQLDYR RNVLTQADVD SFKPYATQFD DMLSSGELEP AYKMFDIVQK
RRYEGFVYAL SLLDKEMDFS APGDAYEYDR DDAAWPKDQT EINELWRQRV KYDALNLKLT
GKKWPEIVEI LQKRYNNAIK RLTQTNSEDV FQGVMNAFSR SIEPHTSYLS PRNAERFQME
MNLSLEGIGA QLQLEDDYTV IKSLIAGGPA AGSEKLSPED KIVGVGQEGG EIVDVIGWRL
DDVVDLIKGP KGSKVVLQIL PKKGGSNAKP FDVTLVRDKI RLEDRAATSK VIEPKDGEYA
NRKVGVIHIP GFYMNLSQDV EKELVKLNEA KVEGIVIDLR GNGGGALTEA VLLTGLFIDM
GPVVQIRDAD GRVSAHRDND GKTSYAGPLT VMVDRYSASA SEIFAAALQD YDRALIVGES
SFGKGTVQQH KSLGRIYDMY EKPIGHVQYT IQKFYRINGG STQLKGVTPN IAYPSALEPG
EYGEAEEKNA LPWDKVPMAQ YGTLNDVTPE LVTNLEAKHL NRIKSSVEFA YINQDIADFK
KHHKEKTVSL VESERIASRE ADEKKVLDRT NERRVANGLA PVKSMEDIKD DAELPDAFLD
ETAYITLDMA DAQKLAKTSA K