Gene Sama_0428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_0428 
Symbol 
ID4602683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp524819 
End bp527146 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content57% 
IMG OID639779764 
Productpeptidase M4, thermolysin 
Protein accessionYP_926308 
Protein GI119773568 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAGC AACAACTCTC ACTCTTGTTT ATCGCCGTTT CGGCAACCCT GGGTACATCA 
GCCTATGCCG CCAACGTAGT GGACGTGGCC AAGATGCGCC ACGCCACTGG CGACATCGGC
AGCGTACTGC AACTGGCTCC CGGCCACGAG TTCCGCACCG CAAAACAACT GGATCTGGGC
AAAGGCCTGA AAAAAGAGCG TCTGCAGCAA TACTTCCACG GAGTGCCTGT TTATGGTTTC
ACTGTGGCCG CTGATCGCTC TGAGATGGGT TTCTACAGCA ACCTCAAAGG CCAGATGCTG
GCCAATATCG ACAAGGATGC GGCTTTTGCC AAGCCCAGCC TGAGCAAAGA CAAAGCCCTG
GAAATCGCCA AGGCCGCCAA GGGTGGCCAT GGTCTTAAGG CCGGCAAGAC CCGTAACGAC
AGCGCCCAGC CATGGATCAT CCTGTACGGC AAAGCGCAGG AGCCACGACT GGTGTATATC
ACCTCTTACG TGGTTGATGG CGATACCCCA AGCCGTCCAT TCACCATGGT AGACGCCCAC
ACAGGTGAAG TACTGGAGCG CTGGGAAGGT CTGAACCATG CCGGCACAGG TACCGGCCCC
GGCGGTAACG CCAAAACCGG CCAGTATGAG TACGGCACTG ACTTCGGCAA CCTGGATGTG
GAAGTGAATG GTGACACCTG CACCATGAAC AATGCCAACG TGAAAACCGT GAACCTGAAC
CACGGCACCA GCGGCAATAC GGCATTCAGC TACACCTGCC CACGCAACAC GGTAAAAGAA
ATCAACGGTG CCTACTCACC GCTGAACGAT GCCCACTACT TCGGTGGCGT GGTGTATGAC
ATGTACGATC AGTGGTATGG CACAGCGCCC CTGTCCTTCC AGCTGACCAT GCGCGTGCAC
TACAGCAACA ACTATGAAAA CGCCTTCTGG GACGGCAGCG CCATGACCTT CGGTGATGGC
CAGAGCTACT TCTACCCACT GGTGTCTCTG GACGTATCAG CCCACGAAGT GAGCCACGGC
TTTACCGAAC AGAACTCGGG CCTGGTGTAT GCCAACCAAT CCGGTGGTAT GAACGAAGCC
TTCTCGGACA TGGCAGGTGA AGCGGCCGAA TTCTTTATGA AAGGCAGCAA CGACTGGCTG
GTTGGCGCCG ATATCTTCAA AGGCGATGGC GCACTGCGCT ACATGGCTGA CCCAACTCAG
GACGGCAAGT CTATCGGTCA TATCGACGAC TACTACGATG GTCTGGATGT GCACTACAGC
TCAGGTGTGT TCAATAAAGC CTTCTACACC CTGGCTACCA CCGAAGGCTG GGATACCCGC
AAAGCCTTTG AAGTCTTCGT TATCGCCAAC CAATTGTACT GGACGCCCAA CAGCCTGATG
TATGAAGGTG CCTGTGGCGT GAGAAATGCC GCCACCGATA AGGGCTACAG CACTGCTGAC
GTAGATGCAG CATTCGCAGC CGTGGGGATC ACCCCCTGCG AAGTTCCACC ACCACCGCCA
CCACCAGAAG CTGAGGTACT GCAAAATGGC GTAGCCGTGA CGGACATCAG CGGCGCTTCA
GGCAGCAAGC AGTATTGGAC TCTCGATGTC CCGGCCGGTG CCAGCAATCT GGTGTTCAAC
CTGTCCGGTG GCTCGGGCGA TGCCGACCTC TATGTGAAGT TTGGCGCCAA CCCAACCAGC
ACTGACTACG ATTGCCGCCC CTATGCAGCA GGCAACAATG AAAACTGTGC CATCAGCAAT
GTCCAGCAAG GTACTTACTG GGTGATGCTG AACGGCTACA GTTCGTACAG CGGCACGACC
CTGGTAGGCA GCTTCGATGG CGGCAGCACT GAGCCAAACG AAGCCCCGGT TGCCAGCTTC
ACCGCCGACT ACACGGGTGC CCTGTACAAT TTCACCAGCA CTGCTACCGA CTCTGACGGC
AACGTGGTCG CCTGGAGCTG GGACTTCGGT GACGGTGAAA CGGCTACCGG TTCTGCTGCA
AGCCATCAGT ATCTGGTGAG TGGCAGCTAC ACTGTCACCC TGACCGTTAC CGACGATGAT
GGCGCCACCG CCAGCACCTC TGAAGTGTTC AACGTGGAAG TGCCTGCCGC CGAGATGGAT
CTGAGCATCA CCAAGGTGAA CGTGTCACGC CGTGGCAGCG CCCGTATCGG TTTGGAATGG
ACAGGTAACA GCGCCTCTTA CACTGTGCTG CGTAACGGTG AAGCCGTCGG CACCACCAGC
GCCAACAGCT ATGTTGACCG CTTCACTGCC ACTGCCGGCA GCAGCGTCAC CTTCCAGGTG
TGCAACTCCG ATGGTGGTTG CTCCGACATT GAAACCGTAA GCTTCTGA
 
Protein sequence
MRKQQLSLLF IAVSATLGTS AYAANVVDVA KMRHATGDIG SVLQLAPGHE FRTAKQLDLG 
KGLKKERLQQ YFHGVPVYGF TVAADRSEMG FYSNLKGQML ANIDKDAAFA KPSLSKDKAL
EIAKAAKGGH GLKAGKTRND SAQPWIILYG KAQEPRLVYI TSYVVDGDTP SRPFTMVDAH
TGEVLERWEG LNHAGTGTGP GGNAKTGQYE YGTDFGNLDV EVNGDTCTMN NANVKTVNLN
HGTSGNTAFS YTCPRNTVKE INGAYSPLND AHYFGGVVYD MYDQWYGTAP LSFQLTMRVH
YSNNYENAFW DGSAMTFGDG QSYFYPLVSL DVSAHEVSHG FTEQNSGLVY ANQSGGMNEA
FSDMAGEAAE FFMKGSNDWL VGADIFKGDG ALRYMADPTQ DGKSIGHIDD YYDGLDVHYS
SGVFNKAFYT LATTEGWDTR KAFEVFVIAN QLYWTPNSLM YEGACGVRNA ATDKGYSTAD
VDAAFAAVGI TPCEVPPPPP PPEAEVLQNG VAVTDISGAS GSKQYWTLDV PAGASNLVFN
LSGGSGDADL YVKFGANPTS TDYDCRPYAA GNNENCAISN VQQGTYWVML NGYSSYSGTT
LVGSFDGGST EPNEAPVASF TADYTGALYN FTSTATDSDG NVVAWSWDFG DGETATGSAA
SHQYLVSGSY TVTLTVTDDD GATASTSEVF NVEVPAAEMD LSITKVNVSR RGSARIGLEW
TGNSASYTVL RNGEAVGTTS ANSYVDRFTA TAGSSVTFQV CNSDGGCSDI ETVSF