Gene Sama_3624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_3624 
Symbol 
ID4605871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp4263188 
End bp4266004 
Gene Length2817 bp 
Protein Length938 aa 
Translation table11 
GC content56% 
IMG OID639783045 
Productprotease, putative 
Protein accessionYP_929496 
Protein GI119776756 
COG category[S] Function unknown 
COG ID[COG4412] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03296] M6 family metalloprotease domain
[TIGR03501] gammaproteobacterial enzyme C-terminal transmembrane domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00224012 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAAAC GATTTAATAC GCCCAAGCTC TGTGCCTTGG GGATTCTACT GGCGCTGGCC 
GGTGGTCAGG CGCTGGCGGC ACCTGCCATG GGTTCACCGG CCGACAGTGG CGTTATCAAT
AAAGAGCGCG TGCTTTACTG GCTTATCAAG CGTGGTGAAA TTGCCGAAGA TGCCAGTGAC
GAAACCAAAG CCGCTGCCGT GGCTGCGTTC GTGGCCCGGG CCAAATCGTC CACACCCGCG
ATGGAGCAAA TCAAAGCTCA GGAGCAAAGC CTTGCCAGTA AGGCTCGCTA TGCTCAAAAG
CTGCGCAGCC AATACCAGCA GGTGGATGAC GCCGATGTCA CCAAAACCGT TAAGGTACTT
GGCGTGTTGG TGGATTTTCC GGATCTGCCC TACAACAATA ACCGCCTGAC AGCGGGTGAT
ACTGACATGT ATTACTCCAG CTATCCGGCA ATTCATTATT CCCAGATGCT GTTTTCTGCT
TCGGGCTTTA CCGGACCGTC CAACCAGACG TTGATTTCCG GGTTTCAATA TTTCCAGCAG
GCTTCGGGCA ACACCTTTTT CTTCACCGGC GCAGTCCGTG ATTGGGTGCG GGCCGACAAC
AACGCCGCCT ACTATGGTGG TAACGACCCC AGCAACGAAG ACAACGACAA GGCTGTGCCC
GAGTTGGTGC TGGAGGCTGT CACCAAGGCA GTGGCTGGCA TGAGTGCCTC CGAGCTTGCC
AGCTATGACG TGGAAGACCC ATACGATATC AACAGTAACG GCAACCTCAA CGAACCAGAC
GGCATTATCG ACCACATCAT GCTGTTCCAT TCCAGCATTG GTGAAGAAGC CGGTGGTGGC
GTGCTCGGCG CCGATGCTAT CTGGTCACAC AGATTCTTTA TCCAGAGTGA TCCCTCTGCG
TACGGCAAGG CCATTCCCGG CACCAACCTG AGAGCCTATG GCTACACGGT GCAGCCCATA
GATGCCGCTG CCGGGGTGTG CACCCACGAA TTTGGCCATG ATCTTGGTCT GCCCGATGAA
TATGACTACG ACCCGGACGG TGATGGCTCG CCTGTGGGTA GCTGGTCGCT GATGTCCGGT
GGTAGTTGGA CTGGTACTGT GGCGGGTTCG GAGCCAGTGG GTTTCAGTCC TTATGCGCGC
TCATTCCTGC AGCGCGAATA CAAGGGTAAA TGGGTTAAGG AACAGGAAAT CAGCCTCGAT
AGCCTGAATT CTGCCGGCAC CGATTTCAAT CTGGTGGAAG GGGTCAATGC CAACGGGGTT
AACCAGCTGT CACTTAAGGT GCCCAGCGAG CCTCTGCCGT TCAAGGCGCC CTATGCCGGA
GACTATCAGT ACTACTCCAA TAAGGGGCAT AAGCTGAACA ATGCCATGTC CTTCAACGTG
ACCCTGCCTG TGGTGGATTC TCTCACGTTG AAAATGCGCG CCCACTGGGA TATCGAGTTC
GACTTCGACT ATATGCAGGT GCAGGTGGAT GGTACGCCCA TTCCGGGCAA CCACACAAAG
GCCACCAGTT ATTACGAGGC AGGACGCAAT GTTATTACCG GTAGCTCAGC CAATATTGCC
GGAAACGAAG GTCCTGATCA CTGGGTTGAG CTGACATACG ATCTCAGCGC TTACAAGGGC
ATGAGCAAGC AGATCAGCAT AGTGTACAAA ACCGACGAGT TTGAAGGGGG TTACGGTATC
GCCATCGATA ACCTGTCTAT CGTCTCCGGC AGTGAAACCC TTTACAGCGA TGATGCCGAA
ACCAGCGACA AGATGATGTT GGCAGGCTTT GTTCGCACCA CGGATACCCG TCCGGGCAAT
GACCGCCGCT ATCTGATCCA ACTGCGCAGC CAAAACGGTA TCGACAAGGG GCTGGCAAGC
CACGGCTACA GTCCCGGTGT GGTGTTATGG TTTGAAAACT TCGATTACAG CGATAACAAC
ACCACTGAGC ACCCCGGCTA TGGCCTGATT GGGGTAGTGG ATGCGGATCA GAACCTGATT
GGTACCCGTG GCACAGATGT GCAGATTCGC GATGCGGCCT TCAGTATCCG TCAGCAAACA
GCGTACGCCG GTGATACCAA CCTCGGCCCT GTGAGTTTGT TCGATGATTC ACTGGATTAC
AGTGCGCCGC TCAAGCCCCA GGCGGGGATG ATATTGCCTG AGCTCGGCCT GACCATGGAA
GTGATCCAGC TCGCCGCCAA TAACAGCACG GCAACGGTAC GACTGAAGAA ACACGACGGC
AGTGTGCCTG AGCCAACGGA CATGAACGTC AGCGTGGGGG TCACCTTCTC TGGTGCTACG
GCAAACTTTA CCAGCACTGT CACCGGTGGC GACGGTAATT ACAGCTACTT CTGGGACTTT
GGTAATGGCG CCAGTTCTAC TCAGGCCAAC CCGAGTTACA CCTACGGTAA TACCGGAACC
TATGACATCA GCCTGACGGT GACCGATGGC AAAGGTGTTG GGGTGACTGT GACTCAGCAA
TTGTCGGTAA CTGTGCCCGT CACGGCGAGT TTCAGTCAGA GCGTCAGTCA GTTAACCGTC
AGTTTTACCA ACAGCAGCAG TGGTGGTGAT GGAAACCTTA GCTACACCTG GAGCTTCGGT
GATGGCCAAA GCAGCACAGC TGCCTCGCCA AGCCATACCT ATGCAGCGGC GGGCAGCTAC
ACTGTGACCC TCACTGTGAC CGATGGCAAG GGGGTTTCCT CTACCCGCAG CGCCAGCGTA
ACCGTTAGCG CACCTGCAGT AACACCTTCA GATGGTGGTT CGGGTGGTGG CAGCCTGGGA
TGGTTGGCCA CCCTGGCGCT GGTGCTTGCC GGTATGGCAA GACGCAGACA AGCCTGA
 
Protein sequence
MVKRFNTPKL CALGILLALA GGQALAAPAM GSPADSGVIN KERVLYWLIK RGEIAEDASD 
ETKAAAVAAF VARAKSSTPA MEQIKAQEQS LASKARYAQK LRSQYQQVDD ADVTKTVKVL
GVLVDFPDLP YNNNRLTAGD TDMYYSSYPA IHYSQMLFSA SGFTGPSNQT LISGFQYFQQ
ASGNTFFFTG AVRDWVRADN NAAYYGGNDP SNEDNDKAVP ELVLEAVTKA VAGMSASELA
SYDVEDPYDI NSNGNLNEPD GIIDHIMLFH SSIGEEAGGG VLGADAIWSH RFFIQSDPSA
YGKAIPGTNL RAYGYTVQPI DAAAGVCTHE FGHDLGLPDE YDYDPDGDGS PVGSWSLMSG
GSWTGTVAGS EPVGFSPYAR SFLQREYKGK WVKEQEISLD SLNSAGTDFN LVEGVNANGV
NQLSLKVPSE PLPFKAPYAG DYQYYSNKGH KLNNAMSFNV TLPVVDSLTL KMRAHWDIEF
DFDYMQVQVD GTPIPGNHTK ATSYYEAGRN VITGSSANIA GNEGPDHWVE LTYDLSAYKG
MSKQISIVYK TDEFEGGYGI AIDNLSIVSG SETLYSDDAE TSDKMMLAGF VRTTDTRPGN
DRRYLIQLRS QNGIDKGLAS HGYSPGVVLW FENFDYSDNN TTEHPGYGLI GVVDADQNLI
GTRGTDVQIR DAAFSIRQQT AYAGDTNLGP VSLFDDSLDY SAPLKPQAGM ILPELGLTME
VIQLAANNST ATVRLKKHDG SVPEPTDMNV SVGVTFSGAT ANFTSTVTGG DGNYSYFWDF
GNGASSTQAN PSYTYGNTGT YDISLTVTDG KGVGVTVTQQ LSVTVPVTAS FSQSVSQLTV
SFTNSSSGGD GNLSYTWSFG DGQSSTAASP SHTYAAAGSY TVTLTVTDGK GVSSTRSASV
TVSAPAVTPS DGGSGGGSLG WLATLALVLA GMARRRQA