Gene Sama_3284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_3284 
Symbol 
ID4605531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp3901708 
End bp3903018 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content53% 
IMG OID639782704 
Producthemolysin, putative 
Protein accessionYP_929156 
Protein GI119776416 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTCT TCGAAAATAG TTTGATTATT CTGATGCTTA TCGGCATCAG TTGTTTTTTT 
TCCATGTCGG AAATCGCCCT GGCGGCCTCC CGTAAAATTC GGCTGCGACA GTTGGCCGAT
GAAGGCAATG AGCGGGCCCG AAAAGTACTT GAGCTGCAGG CTCATCCCGG CAGCTTCTTT
ACCGTGGTGC AAATAGGCCT CAATGCGGTG GCCATCATGG GCGGTATCGT GGGGGAGTCG
GCTTTTACCC CCCATATCAT GGCCTTGCTG GATGGTATAG TGCCAGCTAA GTGGCTGGGC
CAGTTCAGTT TTATCTTGTC TTTTATGCTG GTGACCAGCC TGTTTATTTT GCTGGCAGAT
CTAATGCCAA AGCGTATCGC CATGGCTATG CCAGAGCGTG TTGCTGTGTC ATTGGTGGGG
GCCATGGGCC TGTGCATTAC CGTGCTGCGC CCCCTGGTGT GGGTGTTCAA TGGCCTCGCC
AATGGTTTGT TCCGTATGCT GCGGATCCCC ACAGAGCGTA ACGATGCCAT CACAGAGGAT
GACATTTACG CGGTGATGGA TGCCGGTGCT GAAGCCGGTG TTATCGACAA GGGCGAACAG
CAGATGATGG AGAACGTGTT TGAAATGCAA AGCGTTCCTG TGACCTCAGC CATGACACCA
AGGGAAAGCC TGGTGTTCTT TCTGCAAAGC GACACCGAAG AAGACATCAA GCGTAAAATC
GCCGCCGACC CCCACAGCCA GTTTCTGCTC TGTGATGGCC AGCTGGATGC CATCAAGGGC
TATGTGGACT CCAAGGATTT GCTTATCAAG GTGATAAGCG GTCAGGCATT AAACCTGAAA
GACCCATCAC TGGTACAAAC CTGCCCCATT ATCCCGGATA CCCTGAGCCT GTCTGAGGCG
CTGGATTACT TCCGTAACAA CAGGGTCGAC TTTGCAGTTG TGCTCAATGA GTATGCGTTG
GTGCTGGGGG TGGTGACCTT TAACGATCTG CAAAGCGCTG TGATGGGGAC CTGGTCGCTG
GCCGAAGGTG AAGAACAAAT TGTGGCCCGG GATCCTTCGT CCTGGCTGGT GGATGGGGTA
ACGCCTATTA CCGATGTGAT GCGTGCCTTT GGCATCGACA GTTTTCCCCA GAGCCAGAAC
TATGAAACCA TCGCAGGTTT CATTATGTTT ATGCTGCGTA AAATCCCCCG CCGTACCGAC
TTTGTGGTCT ATGCCGGTTA CAAATTTGAA GTGGTCGATA TCGACTCATA CAAGGTGGAT
CAGCTGCTGG TGACCAAGGT GGAGTTGCCA CCCGGTGCTG AGGATCAGTA A
 
Protein sequence
MSFFENSLII LMLIGISCFF SMSEIALAAS RKIRLRQLAD EGNERARKVL ELQAHPGSFF 
TVVQIGLNAV AIMGGIVGES AFTPHIMALL DGIVPAKWLG QFSFILSFML VTSLFILLAD
LMPKRIAMAM PERVAVSLVG AMGLCITVLR PLVWVFNGLA NGLFRMLRIP TERNDAITED
DIYAVMDAGA EAGVIDKGEQ QMMENVFEMQ SVPVTSAMTP RESLVFFLQS DTEEDIKRKI
AADPHSQFLL CDGQLDAIKG YVDSKDLLIK VISGQALNLK DPSLVQTCPI IPDTLSLSEA
LDYFRNNRVD FAVVLNEYAL VLGVVTFNDL QSAVMGTWSL AEGEEQIVAR DPSSWLVDGV
TPITDVMRAF GIDSFPQSQN YETIAGFIMF MLRKIPRRTD FVVYAGYKFE VVDIDSYKVD
QLLVTKVELP PGAEDQ