Gene Ssed_3990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_3990 
Symbol 
ID5610460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp4883183 
End bp4884736 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content49% 
IMG OID640934944 
Productsulfatase family protein 
Protein accessionYP_001475722 
Protein GI157377122 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAACA AATTAACCAA GTCAGCGATC GCCTTAGCAG TTGTTGCCTC TAGCGGTGTG 
TCAGCAGCAT CAACTCAGCC TAACGTCGTT GCCATCATGT TAGATGATGT CACCACCATG
GATATCTCCG CATATCACCG CGGCTTAGGT GCAGTCAGTA CGCCTAATAT CGACCGAATT
GCCGAGCGCG GCATGATGGT GAGTGATTAC TATGCTCAAG GCAGTTCAAC TGCGGGTCGC
TCTGCCTTTA TCACTGGCCA ATATCCTATT CGAACCGGCT TAACCTCTGT TGGACAACCG
GGCTCGACCC GCGGCCTGCA AAAAGAGGAC CCGACCCTAG CGGAAATGCT CAAAGACAAA
GGCTATGCCA CCGTACATGT GGGTAAGAGT CACCTGGGTG ATAACAATGA CCACCTACCA
ACTGTGCATG GCTTTGATGA GTTCTACGGC TTCCTCTATC ACCTCAACGT GATGGAGATG
CACGAGCAGC CGGAGTTTCC TAAAGATCCC AACTTTAAGG GGCGTGGCCG CAACATGATC
CATACTGTCG CGACCGATAA GTTTGACGAC ACCGTCGACC CACGTTTTGG TGTGATCGGT
AAGCAAACCA TTAGCGACCA AGGTGAACTG GGAGCTAAGC GGATGCAGAC TGTCGATGGT
GAGTTCTTAG ATTTTGCTAT CAACTGGCTA GAAAAGCATG AAGCAACGAA TGACGACCAG
CCATATTTCA TGTGGTACAA CCCAACGCGT ATGCACCAGA AAACCCATGT GCGCCCTGAG
TATCAAGGTG CTAGCCAACA TAATACCTAC TATGACGGTT TAGTTGAGCT CGATGATCAA
ATTGGTGTGC TTCTCGACAA GCTAGAAGCG ACCGGAGAGA TAGACAACAC CATCATCCTA
TTTACCTCCG ACAACGGTGT GAATCTGGAC CATTGGCCTG ACTCCGGAGC GGCGTCTTTC
CGTGGCCAAA AAGGTACCAC TTGGGATGGC GGCTTCCGCG TACCAATGTT AGTGAGCTGG
CCAGCCAAGA TCCCTCAAGG AGAATATACC GATGGTTTGA TGTCTGCCGA AGATTGGGTG
CCAACGATTA TGGCTGCGGC GGGTGACGCG GATATCAAGC AAGACTTGCT AACAGGCAAG
AAGATTAACG ACGAAACCTA CAAGGTGCAT ATCGATGGTT ATAACCAACT GGATATGCTG
ACTGAGGGTG GCAAGAGTAA TCGACATGAG TTCTTCTTCT ATAACGAGAA TAGCTTAAAT
GCATTCCGTG TTGATGAGTG GAAAGTACAC CTTAAAACCA AAACCGAATG GATTGCCCCT
GCAGATGAGT GGCCATTGGG CATGATCCTC AATATTAAAG CGGATCCGTT TGAGCGCTCT
CCTGATACCC GCGGTTGGTT CCTCTGGATG AAAGAGAAGA CTTGGGTACT GCCAAAGCTA
CTTAAAGCAG TTGGTAAACA TCAGCAGTCA CTTAAAGCCT TCCCTCCTCG TTTAACAAAC
GGCGGCATTG GTATGAACGA CAAGTCTGTC GTTGAAGAAA AGAGCAACAA ATAA
 
Protein sequence
MVNKLTKSAI ALAVVASSGV SAASTQPNVV AIMLDDVTTM DISAYHRGLG AVSTPNIDRI 
AERGMMVSDY YAQGSSTAGR SAFITGQYPI RTGLTSVGQP GSTRGLQKED PTLAEMLKDK
GYATVHVGKS HLGDNNDHLP TVHGFDEFYG FLYHLNVMEM HEQPEFPKDP NFKGRGRNMI
HTVATDKFDD TVDPRFGVIG KQTISDQGEL GAKRMQTVDG EFLDFAINWL EKHEATNDDQ
PYFMWYNPTR MHQKTHVRPE YQGASQHNTY YDGLVELDDQ IGVLLDKLEA TGEIDNTIIL
FTSDNGVNLD HWPDSGAASF RGQKGTTWDG GFRVPMLVSW PAKIPQGEYT DGLMSAEDWV
PTIMAAAGDA DIKQDLLTGK KINDETYKVH IDGYNQLDML TEGGKSNRHE FFFYNENSLN
AFRVDEWKVH LKTKTEWIAP ADEWPLGMIL NIKADPFERS PDTRGWFLWM KEKTWVLPKL
LKAVGKHQQS LKAFPPRLTN GGIGMNDKSV VEEKSNK