Gene Sbal_3874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal_3874 
Symbol 
ID4843492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS155 
KingdomBacteria 
Replicon accessionNC_009052 
Strand
Start bp4550058 
End bp4552394 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content48% 
IMG OID640121138 
Productpeptidase M4 thermolysin 
Protein accessionYP_001052214 
Protein GI126176065 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCTAA ACAAAAAACT GTCGCTACTC TTCATTGCTG TGTCGGCCAC GTTAGGCACA 
TCAGCATTTG CTGCTGATGT TGTCGATGTC GGCACACTTA GAAGCGCGTC TTCAAGCAAT
AACTTAGTGT CTCAGTTCAA CTTAGACGCA GGTAGCCAAC TCAAAGTCGA GAAAAAACTC
GACCTAGGTC AAGGCAAGCA GAAACAACGT TTACAACAAT ATTTCCATGA CGTGCCTGTT
TATGGTTTCT CTGTTGCAAC GTCACAATCG AGCATGGGCT TCTACAGCGA CATGTCTGGC
CGCGTATTAA AAAATATTGA AAAGTCGGCT GATTTTGTCA AACCCACTTT AACGGCTAAC
AAAGCCTTAG ACATTGCCAT TCGTGGCAAG TCAGAGAAAG CTGTTGCCGG CCTAAAAGCT
GAAAACAAAC AAGCTAAATT ATGGCTTTAC CTTGATGATG CGGCTAAAAC TCGCCTCGTT
TATGTGACGT CTTTTGTGGT ATATGGCGAT GAGCCAAGCC GTCCATTCAC TATGATTGAC
GCGCACTCAG GTGAAGTGCT CAAGCGTTGG GAAGGGATTA ACCACGCCGC GAGCGGTACA
GGTCCTGGCG GTAACATCAA GACTGGCCAA TACGAATACG GTACGGATTT TTCTTACTTA
GACGTTGAAG TGAGCGGTGA CACCTGCACC ATGAACTCAC CTAACGTGAA AACCGTTAAC
TTAAACGGTG CCACATCAGG CGCAACAGCC TTCTCTTATA CTTGCCCACG CAATACAGTT
AAAGAGATCA ACGGTGCTTA CTCGCCATTA AACGATGCGC ACTATTTTGG TAACGTGATC
TACAACATGT ATAGCGAGTG GTACAACACT GCGCCGTTAA CTTTCCAGTT AACCATGCGG
GTTCACTACA GCAGCAACTA CGAAAATGCC TTCTGGGACG GCAGTGCTAT GACCTTCGGT
GATGGCGCAA CGACCTTCTA CCCATTAGTG AGCTTAGATG TCTCGGCACA CGAAGTGAGC
CATGGTTTCA CTGAGCAAAA CTCAGGCCTG ATTTACGATG CTCAATCTGG TGGCATGAAC
GAAGCCTTCT CAGATATGGC GGGTGAAGCT GCTGAATTTT ACATGCACGG CACGAACGAC
TGGTTAGTCG GCGCTGATAT CTTTAAAGGC AATGGCGCAC TGCGTTACAT GGCAGATCCA
ACCTTAGACG GAATCTCAAT CGGCCATATC GACGATTACT ATGATGGCAT CGACGTACAC
CACAGTTCAG GTGTCTTCAA CAAAGCCTTC TACACACTCG CAAACTTACC GGGCTGGGAT
ACACGCACTG CATTCCAAAC ATTTGTGGTC GCTAACCAAT TATATTGGAC AGCTGACAGC
TTGTTCTGGC AAGGCGCTTG TGGCGTTAAA TCTGCGGCGA CTGACTTAGG CTTAAGTGCT
GATGATGTGG TTACTGCGTT TGCTGCAGTA GGTATCACAC CTTGTGAAAC GCCACCACCA
CCGCCACCAC CAGAAGCCAC TGAGCTCGCG AACGGTGTCG CTGTGACGGA TTTAGCGGGC
GCATCTGGTA GCAAACAATA CTACAAACTC GACGTAGCAA GCGGTGCAAC CAACCTGAGC
TTTAATATGT CAGGCGGCAC GGGTGATGCG GACATGTATG TTAAATTCGG CCAAGCTCCT
ACCTCTAGTA GCTATGATTG CCGTCCATAC AAAGCCGGTA ATGCCGAAAG CTGCCCTATC
GATCCAGCCC AAACAGGAAC TTATTGGGTC ATGGTCAGTG GCTATAGCAG CTACACAGGT
GTGAGCCTAG TGGGTGCTTA TGACGGTGGC GATGAGATAC CAAACCAAGA TCCTACAGCG
GGCTTTACGG CAAGCTTTGC CAACGGTAAC GGTAGCTTCA CTAGCACCAG TACTGACAGC
GATGGTGATG TCGTTGCATG GAGCTGGAGT TTCGGTGATG GCACGACTGC TTCAGGTGCA
AGTGTAAATC ATCAGTATGC TCAGTCAGGC AACTACACAG TGACATTGAC AGTGACTGAC
AACGATGGCG CAACAGCAAG CACTTCTGAA GAATTTGCAG TTGAAGTGCC AGAAGTGGCG
TTAGAAATGA CAGTGAAAAA CGCTAACAAA TCTCGTCGCG GCAGCATTCG TGTTGCACTC
GCATGGGAAG GTCATAACGC TGATGAGTAC ACTATCTTCC GTAACGGCGT TGCGGTAGGT
ACTTCAAGCA ATTCGTCGTT TATTGACCGT TTCACTGACC TAGCTGGCAC TAGCTTCAGC
TATAAAGTGT GTGAAACTAA CGGCCCTTGC TCAAACGAAG AAACCGTTAA CTTTTAA
 
Protein sequence
MMLNKKLSLL FIAVSATLGT SAFAADVVDV GTLRSASSSN NLVSQFNLDA GSQLKVEKKL 
DLGQGKQKQR LQQYFHDVPV YGFSVATSQS SMGFYSDMSG RVLKNIEKSA DFVKPTLTAN
KALDIAIRGK SEKAVAGLKA ENKQAKLWLY LDDAAKTRLV YVTSFVVYGD EPSRPFTMID
AHSGEVLKRW EGINHAASGT GPGGNIKTGQ YEYGTDFSYL DVEVSGDTCT MNSPNVKTVN
LNGATSGATA FSYTCPRNTV KEINGAYSPL NDAHYFGNVI YNMYSEWYNT APLTFQLTMR
VHYSSNYENA FWDGSAMTFG DGATTFYPLV SLDVSAHEVS HGFTEQNSGL IYDAQSGGMN
EAFSDMAGEA AEFYMHGTND WLVGADIFKG NGALRYMADP TLDGISIGHI DDYYDGIDVH
HSSGVFNKAF YTLANLPGWD TRTAFQTFVV ANQLYWTADS LFWQGACGVK SAATDLGLSA
DDVVTAFAAV GITPCETPPP PPPPEATELA NGVAVTDLAG ASGSKQYYKL DVASGATNLS
FNMSGGTGDA DMYVKFGQAP TSSSYDCRPY KAGNAESCPI DPAQTGTYWV MVSGYSSYTG
VSLVGAYDGG DEIPNQDPTA GFTASFANGN GSFTSTSTDS DGDVVAWSWS FGDGTTASGA
SVNHQYAQSG NYTVTLTVTD NDGATASTSE EFAVEVPEVA LEMTVKNANK SRRGSIRVAL
AWEGHNADEY TIFRNGVAVG TSSNSSFIDR FTDLAGTSFS YKVCETNGPC SNEETVNF