Gene Sama_3507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_3507 
Symbol 
ID4605754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp4139729 
End bp4141669 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content56% 
IMG OID639782928 
Productsulfatase 
Protein accessionYP_929379 
Protein GI119776639 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAGCA CAAAACCGCC ACAAAGCATT TGGCGGATGA TAGCCATTTT TGGTCTGTTG 
GCCATGGTTG CCCTGACGGC AAGCCGTATT GGACTTGGTC TGTGGCAGAG TGACAGAGTG
GCAGCCGCTG AGGGATGGAC CCATATGCTG TTGCAAGGCG CGCGGGTCGA TATTGCCACC
CTGTGCTGGC TGTGGGGCAT TGCCGCCCTG GGCACAGTGA TTTTTGCCGG CAATCACGCC
ATAGGCCGTG TGTGGCTGGT GCTGCTCAGA GGCTGGCTGG TGCTTGGCCT GTGGCTGATG
CTGTTTCTTG AACTTTCAAC GCCTTCTTTC ATCGAGGAAT ACGGCATTCG CCCCAATCGC
CTCTATGTGG AATACCTGAT TTACCCCAAA GAAGTCCTGT CCATGCTCTG GAGTGGCAGA
CCGATGGAGT TGGTATTTGG CCTGCTGGCG TCCACCGGTA TTCTCTGGGG GGGGTGGCGT
TTATCCGGGC GTCTTTCAAC CCATCTGGTT TATCCGCGCT GGTACTGGCG ACCCGTGTTT
GCACTCCTGG TGATTATTGT GACCCTGATG GGAGCACGCT CGACCTTGGG GCACAGGCCG
CTCAATCCGG CCATGGTGGC CTTCTGCGAC GATCCTCTGG TGAATTCGCT CACGGTAAAT
TCGGCCTATT CACTGGTGTT TGCCATGACT CAGATGGGTC AGGAAGAAAA TGCCGCCAAG
ATGTATGGTC AATTGACGGA CGAGGAAATC ATTGCTGCTA TCCGCGAGGA CAGCGGCAGG
TCTGCTGAGC GTTTTGTTTC AGACCTATTG CCCAGTCTTT CGGATAACCC AGCGTCCTAT
CAGGGCAAGC CCCGCAATCT GGTGATTATT CTGCAGGAAA GCCTGGGTGC CCGTTTTGTC
GGTAGCCTCG GTGGTTTGCC ATTGACGCCC AATCTCGATG CGCTGTCTCA GGAAGGTTGG
TACTTCGATA ATCTCTTCGC CACAGGCACC CGCTCGGTAC GCGGAATCGA GGCTGTGACC
ACGGGCTTTA CCCCCACTCC CGCCAGGGCC GTAGTCAAAC TGGGCAAGAG CCAACAGGGC
TTTTTCACCA TTGCCGATTT CTTACGTCGC CAGGGTTACG ACACTTCGTT TATCTATGGC
GGTGAAAGCC ATTTTGACAA TATGCGCAGT TTCTTCCTGG GCAATGGCTT TGGCCGTATC
GTCGACCAGG ATGACTACCA GAATCCGGCC TTTGTGGGCA GTTGGGGGGT ATCAGATGAA
GACCTGATGC GCCGCGCCGA CAGTGAGTTT AAGCGCCTGC ATCAGCAGGG CACGCCCTTC
TTCAGTCTGG TGTTCAGCTC CAGTAACCAC GATCCCTTCG AGTTTCCCGA TGATCGCATC
GAACTCTATG AGCAGCCGAA GCAAACCCGT AATAACGCCG CCAAGTATGC GGATTTCGCT
ATAGGTGAGT TCTTTAAGCT CGCCAAAAAT GCCGACTACT GGCAGGACAC GGTATTTTTG
GTGGTGGCGG ATCACGACAG TCGTGTTGGC GGTGCCAATC TGGTGCCGGT AAATCGCTTT
CGTATTCCCG GCCTCATCAT TGCCGATGGG GTGGCGCCCA AGCGCGACCA GCGGGTGGTG
AGCCAAATCG ATTTGGCGCC GACCTTGTTA TCGCTGATAG GCGTTTCTGG CCGCTACCCC
ATGCTGGGGA AAGATTTGAC CCGCACACCC GATAATTGGC CGGGGCGGGC GCTGATGCAG
TACGACAAGA ACTTTGCCTA TATGCGTGGT CAGGATTTGG TGATCCTGCA GCCTGAGCGC
GACCCCGAAG GTTTTACCTA TCAACCGGAA GATGGCAGCT TGCTGCCAAG TCCACAGCCC
AAGAGTATGC AAAAAACCGC CTTGGCCTGG GCTTTGTGGG GAAGTCTTGC CTATCAGAAA
GGGTTATTCA GGCAGGAGTG A
 
Protein sequence
MRSTKPPQSI WRMIAIFGLL AMVALTASRI GLGLWQSDRV AAAEGWTHML LQGARVDIAT 
LCWLWGIAAL GTVIFAGNHA IGRVWLVLLR GWLVLGLWLM LFLELSTPSF IEEYGIRPNR
LYVEYLIYPK EVLSMLWSGR PMELVFGLLA STGILWGGWR LSGRLSTHLV YPRWYWRPVF
ALLVIIVTLM GARSTLGHRP LNPAMVAFCD DPLVNSLTVN SAYSLVFAMT QMGQEENAAK
MYGQLTDEEI IAAIREDSGR SAERFVSDLL PSLSDNPASY QGKPRNLVII LQESLGARFV
GSLGGLPLTP NLDALSQEGW YFDNLFATGT RSVRGIEAVT TGFTPTPARA VVKLGKSQQG
FFTIADFLRR QGYDTSFIYG GESHFDNMRS FFLGNGFGRI VDQDDYQNPA FVGSWGVSDE
DLMRRADSEF KRLHQQGTPF FSLVFSSSNH DPFEFPDDRI ELYEQPKQTR NNAAKYADFA
IGEFFKLAKN ADYWQDTVFL VVADHDSRVG GANLVPVNRF RIPGLIIADG VAPKRDQRVV
SQIDLAPTLL SLIGVSGRYP MLGKDLTRTP DNWPGRALMQ YDKNFAYMRG QDLVILQPER
DPEGFTYQPE DGSLLPSPQP KSMQKTALAW ALWGSLAYQK GLFRQE