Gene Sbal195_0456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_0456 
Symbol 
ID5752173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp522977 
End bp525313 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content48% 
IMG OID641286720 
Productpeptidase M4 thermolysin 
Protein accessionYP_001552896 
Protein GI160873580 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTAA ACAAAAAACT GTCGCTACTC TTCATTGCTG TGTCGGCCAC GTTAGGCACA 
TCAGCATTTG CTGCTGATGT TGTCGATGTC GGCACACTCA GAAGCGCGTC TTCAAGCAAT
AACTTAGTGT CTCAGTTCAA CTTAGACGCA GGTAGCCAAC TCAAAGTCGA GAAAAAACTC
GACCTAGGTC AAGGCAAGCA AAAGCAACGT TTACAACAAT ATTTCCATGA CGTGCCTGTT
TATGGTTTCT CTGTTGCAAC GTCACAATCG AGCATGGGCT TCTACAGTGA CATGTCTGGC
CGCGTATTAA AAAATATTGA AAAGTCGGCT GATTTTGTCA AACCCACTTT AACGGCTAAC
AAAGCGTTAG ACATTGCCAT TCGTGGCAAG TCAGAGAAAG CTGTTGCCGG CCTAAAAGCT
GAAAACAAAC AAGCTAAATT ATGGCTTTAC CTGGATGATG CGGCTAAAAC TCGCCTCGTT
TATGTGACGT CTTTTGTGGT ATATGGCGAT GAGCCAAGCC GTCCATTCAC TATGATTGAC
GCACACTCAG GTGAAGTGCT CAAGCGTTGG GAAGGGATTA ACCACGCCGC TAGCGGTACA
GGTCCTGGCG GTAACATCAA GACTGGCCAA TACGAATACG GTACGGATTT TTCTTACTTA
GACGTTGAAG TGAGTGGTGA CACCTGCACC ATGAACTCAC CTAACGTGAA AACCGTTAAC
TTAAACGGTG CCACATCAGG CGCAACAGCC TTCTCTTATA CTTGCCCACG CAATACCGTT
AAAGAGATCA ACGGTGCTTA CTCGCCATTA AACGATGCGC ACTATTTTGG TAACGTGATC
TACAACATGT ATAGCGAGTG GTACAACACT GCGCCGTTAA CTTTCCAGTT AACCATGCGG
GTTCACTACA GCAGCAACTA TGAAAACGCC TTCTGGGACG GCAGTGCTAT GACCTTCGGT
GATGGCGCAA CGACCTTCTA TCCATTAGTG AGCTTAGATG TCTCGGCACA CGAAGTGAGC
CATGGTTTCA CTGAGCAAAA CTCAGGCCTG ATTTACGATG CTCAATCTGG TGGCATGAAC
GAAGCCTTCT CTGATATGGC GGGTGAAGCT GCTGAATTTT ACATGCACGG CACGAACGAC
TGGTTAGTCG GTGCAGATAT CTTTAAAGGC AATGGCGCAC TGCGTTACAT GGCAGATCCA
ACCTTAGACG GTATCTCAAT CGGCCATATC GACGATTACT ATGATGGTAT CGACGTACAC
CACAGTTCAG GTGTCTTCAA CAAAGCCTTC TACACACTCG CAAACTTACC GGGCTGGGAT
ACACGCACTG CATTCCAAAC ATTTGTGGTT GCTAACCAAT TATATTGGAC GGCTGACAGC
TTGTTCTGGC AAGGCGCTTG TGGCGTTAAA TCTGCAGCGA CTGACTTAGG CTTAAGTGCT
GACGATGTGG TTACTGCGTT TGCAGCTGTG GGTATCACAC CTTGTGAAAC GCCACCACCA
CCGCCACCAC CAGAAGCCAC TGAGCTCGCG AACGGTGTCG CTGTGACGGA TTTAGCGGGC
GCATCTGGTA GCAAACAATA CTACAAACTC GACGTAGCAA GCGGTGCAAC CAACCTGAGT
TTTAATATGT CAGGCGGCAC GGGTGATGCG GACATGTATG TTAAATTCGG CCAAGCTCCT
ACCTCTAGTA GCTATGATTG CCGTCCATAC AAAGCCGGTA ATGCCGAAAG CTGCCCTATC
GATCCAGCCC AAACAGGAAC TTATTGGATC ATGGTCAGTG GCTATAGCAG CTACACAGGT
GTGAGCCTAG TGGGTGCTTA TGACGGTGGC GATGAGATAC CAAACCAAGA TCCAACAGCG
GGCTTTACGG CAAGCTTTGC CAACGGTAAC GGTAGCTTCA CTAGCACCAG TACTGACAGC
GACGGTGATG TCGTTGCATG GAGCTGGAGT TTCGGTGATG GCACAACTGC TTCAGGCGCA
AGTGTAAATC ATCAGTATGC ACAGTCAGGC AACTACACAG TGACACTGAC AGTGACTGAC
AACGATGGCG CAACAGCAAT CACTTCTGAA GAATTTGCGG TTGAAGTACC AGAAGTGGCG
TTAGAAATGA CAGTGAAAAA CGCTAACAAA TCTCGTCGCG GTAGCATTCG TGTTGCACTC
GCATGGGAAG GCCATAACGC TGATGAGTAC ACTATCTTCC GTAACGGCGT TGCGGTTAGT
ACTTCAAGCA ATTCATCGTT TATTGACCGT TTCACTGACC TAGCTGGCAC TAGCTTCAGC
TATAAAGTGT GTGAAACTAA CGGCCCTTGC TCAAACGAAG AAACCGTTAA CTTTTAA
 
Protein sequence
MMLNKKLSLL FIAVSATLGT SAFAADVVDV GTLRSASSSN NLVSQFNLDA GSQLKVEKKL 
DLGQGKQKQR LQQYFHDVPV YGFSVATSQS SMGFYSDMSG RVLKNIEKSA DFVKPTLTAN
KALDIAIRGK SEKAVAGLKA ENKQAKLWLY LDDAAKTRLV YVTSFVVYGD EPSRPFTMID
AHSGEVLKRW EGINHAASGT GPGGNIKTGQ YEYGTDFSYL DVEVSGDTCT MNSPNVKTVN
LNGATSGATA FSYTCPRNTV KEINGAYSPL NDAHYFGNVI YNMYSEWYNT APLTFQLTMR
VHYSSNYENA FWDGSAMTFG DGATTFYPLV SLDVSAHEVS HGFTEQNSGL IYDAQSGGMN
EAFSDMAGEA AEFYMHGTND WLVGADIFKG NGALRYMADP TLDGISIGHI DDYYDGIDVH
HSSGVFNKAF YTLANLPGWD TRTAFQTFVV ANQLYWTADS LFWQGACGVK SAATDLGLSA
DDVVTAFAAV GITPCETPPP PPPPEATELA NGVAVTDLAG ASGSKQYYKL DVASGATNLS
FNMSGGTGDA DMYVKFGQAP TSSSYDCRPY KAGNAESCPI DPAQTGTYWI MVSGYSSYTG
VSLVGAYDGG DEIPNQDPTA GFTASFANGN GSFTSTSTDS DGDVVAWSWS FGDGTTASGA
SVNHQYAQSG NYTVTLTVTD NDGATAITSE EFAVEVPEVA LEMTVKNANK SRRGSIRVAL
AWEGHNADEY TIFRNGVAVS TSSNSSFIDR FTDLAGTSFS YKVCETNGPC SNEETVNF