Gene Sbal223_3514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3514 
Symbol 
ID7088627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp4175527 
End bp4177443 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content48% 
IMG OID643462398 
Productpeptidase U32 
Protein accessionYP_002359419 
Protein GI217974668 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.163924 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGC CAGAGATATA CACTCCCGAG AATCTTTCTC ACGTCCATAA CCGCCTTGAG 
TTATTGGCGC CTGCAAAAAA TGCTGACTAT GGTATCGAAG CCATACGCCA TGGTGCTGAT
GCGGTATACA TAGGGGGGCC CGCATTCGGT GCGCGGGCAA CGGCAGGTAA CAGTGTTGAA
GATATCGCGC GACTCTGTAC TTATGCTCAT CAATATCATG CTCAGGTATT CGTCGCCCTC
AATACTATTT TGATGGATAA CGAACTGGCC GACGCCGAAA AACTCATTTG GCAATTGTAC
GAAGCCGGTG CCGATGCACT GATTGTGCAG GACATGGGTG TATTGCAACT CAATTTGCCG
CCGATTGCCC TGCATGCCAG TACCCAAATG GATAACCGCA GCCCAGAAAA AGTGGCGTTT
TTAGAGCAAG TTGGTTTTTC ACAGGTGGTG CTCGCCCGTG AATTAGGCTT GAGCCAAATC
CGTGATGTGG CCGCGCATAC TAACATGCAG CTCGAATTCT TTATCCACGG CGCTCTGTGT
GTGGCTTACA GTGGTTTGTG TAACTTGAGT CATTCCTTTA GTAACCGCAG TGCTAACCGC
GGTGAATGCT CGCAAATGTG CCGCCTGCCG GGCAATCTTA AAACCCGTCA AGGCGATGTG
TTAGCGCAGA ATGAACATTT ATTGTCTTTA AAAGACAATA ACCAAACCGA AAACCTTGAA
GCGCTCATCG ATGCGGGCAT TCGCTCCTTT AAGATTGAAG GTCGCTTAAA AGATTTAAGC
TACGTTAAAA ACGTCACCGC GCATTATCGC CAAAAGCTAG ATGCGATTAT GGCGCGTCGC
CCTGAGTTTA AGGCCTCGTC CCACGGCCGC TGTGAACATT CCTTCATCCC CGATCCTGAA
AAGACCTTTA ACCGCGGCAG CACAGATTAC TTTGTTAACG AACGTAGCCA AGGCATTAAA
GATTTCCGTT CGCCAAAATA CATAGGCGAA GAAGTCGGTA AAGTGGTCAG CATAGGCAAA
GACTTTATCC AAGTGAGCTC AACCCATGAG TTTAATAACG GCGATGGCTT AGCGTTCTTC
CCTGCGAATT TTGCCATGGC AAAACAATCG GATGACAAAC TGCAAGGCCT GCGAGTTAAC
CGCGCCGAAG GCTTAAAACT GCATATATTG CAGGTGCCAA AGGATTTAAA AGTCGGCATG
ACCTTATATC GTAACCATAA TCAAGCCTTC GAAGCCCTGT TATTGAAAGA ATCTTCTAAG
CGGATTATCG GCGTCGATTT ACGTTTGAGC GACACTCAAG ACGGACTGGC GCTGACCTTG
ACCGATATCT ATGGCCTAAG CGCCACAGTC AATTTAGTGG TCGAGAAAAC ACCTGCGACA
GATGTAGACA AGACCTTGCA AAATACCCGC ACTCAGCTGG GTAAGCTCGG TAGTACCGAC
TTTGTTGCCC GCAGTATCAG TATCGATACG GCGCAAGCCT GGTTCTTGCC AGCGTCAGTG
CTCAATGGTC TGCGCCGTGA TGGCGTGGCC GCTTTAGAGG CTGCCCGTGT ACAAGGTTAT
GTGCGTCCAC TGCCGTGGAA ACATAATCAA GATGCAGTGT ATCCCGTCAA GCATTTGAGT
TACTTAGGTA ACGTAGCCAA CGAAAAGGCC AAGGATTTTT ATCAACGTCA CGGCGTGATT
GAGATCCAAG ATACCTATGA GAAAAACGGC GTCACTGAAG ACGTGCCATT GATGATCACT
AAGCATTGTC TGCGGTTTAA CTTTAATTTG TGTCCGAAGG AAGTACCAGG CATCAAGGCC
GATCCTATGG TGCTTGAAAT CGGTAACGAT GTGCTCAAGT TGGTATTCGA CTGTCCTAAA
TGTGAAATGA TGGTGGTCGG TGAAAACCGT CAAGTGCGAG GCCAGAAACA CCTTTAA
 
Protein sequence
MSQPEIYTPE NLSHVHNRLE LLAPAKNADY GIEAIRHGAD AVYIGGPAFG ARATAGNSVE 
DIARLCTYAH QYHAQVFVAL NTILMDNELA DAEKLIWQLY EAGADALIVQ DMGVLQLNLP
PIALHASTQM DNRSPEKVAF LEQVGFSQVV LARELGLSQI RDVAAHTNMQ LEFFIHGALC
VAYSGLCNLS HSFSNRSANR GECSQMCRLP GNLKTRQGDV LAQNEHLLSL KDNNQTENLE
ALIDAGIRSF KIEGRLKDLS YVKNVTAHYR QKLDAIMARR PEFKASSHGR CEHSFIPDPE
KTFNRGSTDY FVNERSQGIK DFRSPKYIGE EVGKVVSIGK DFIQVSSTHE FNNGDGLAFF
PANFAMAKQS DDKLQGLRVN RAEGLKLHIL QVPKDLKVGM TLYRNHNQAF EALLLKESSK
RIIGVDLRLS DTQDGLALTL TDIYGLSATV NLVVEKTPAT DVDKTLQNTR TQLGKLGSTD
FVARSISIDT AQAWFLPASV LNGLRRDGVA ALEAARVQGY VRPLPWKHNQ DAVYPVKHLS
YLGNVANEKA KDFYQRHGVI EIQDTYEKNG VTEDVPLMIT KHCLRFNFNL CPKEVPGIKA
DPMVLEIGND VLKLVFDCPK CEMMVVGENR QVRGQKHL