Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssed_3990 |
Symbol | |
ID | 5610460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sediminis HAW-EB3 |
Kingdom | Bacteria |
Replicon accession | NC_009831 |
Strand | + |
Start bp | 4883183 |
End bp | 4884736 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640934944 |
Product | sulfatase family protein |
Protein accession | YP_001475722 |
Protein GI | 157377122 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTAACA AATTAACCAA GTCAGCGATC GCCTTAGCAG TTGTTGCCTC TAGCGGTGTG TCAGCAGCAT CAACTCAGCC TAACGTCGTT GCCATCATGT TAGATGATGT CACCACCATG GATATCTCCG CATATCACCG CGGCTTAGGT GCAGTCAGTA CGCCTAATAT CGACCGAATT GCCGAGCGCG GCATGATGGT GAGTGATTAC TATGCTCAAG GCAGTTCAAC TGCGGGTCGC TCTGCCTTTA TCACTGGCCA ATATCCTATT CGAACCGGCT TAACCTCTGT TGGACAACCG GGCTCGACCC GCGGCCTGCA AAAAGAGGAC CCGACCCTAG CGGAAATGCT CAAAGACAAA GGCTATGCCA CCGTACATGT GGGTAAGAGT CACCTGGGTG ATAACAATGA CCACCTACCA ACTGTGCATG GCTTTGATGA GTTCTACGGC TTCCTCTATC ACCTCAACGT GATGGAGATG CACGAGCAGC CGGAGTTTCC TAAAGATCCC AACTTTAAGG GGCGTGGCCG CAACATGATC CATACTGTCG CGACCGATAA GTTTGACGAC ACCGTCGACC CACGTTTTGG TGTGATCGGT AAGCAAACCA TTAGCGACCA AGGTGAACTG GGAGCTAAGC GGATGCAGAC TGTCGATGGT GAGTTCTTAG ATTTTGCTAT CAACTGGCTA GAAAAGCATG AAGCAACGAA TGACGACCAG CCATATTTCA TGTGGTACAA CCCAACGCGT ATGCACCAGA AAACCCATGT GCGCCCTGAG TATCAAGGTG CTAGCCAACA TAATACCTAC TATGACGGTT TAGTTGAGCT CGATGATCAA ATTGGTGTGC TTCTCGACAA GCTAGAAGCG ACCGGAGAGA TAGACAACAC CATCATCCTA TTTACCTCCG ACAACGGTGT GAATCTGGAC CATTGGCCTG ACTCCGGAGC GGCGTCTTTC CGTGGCCAAA AAGGTACCAC TTGGGATGGC GGCTTCCGCG TACCAATGTT AGTGAGCTGG CCAGCCAAGA TCCCTCAAGG AGAATATACC GATGGTTTGA TGTCTGCCGA AGATTGGGTG CCAACGATTA TGGCTGCGGC GGGTGACGCG GATATCAAGC AAGACTTGCT AACAGGCAAG AAGATTAACG ACGAAACCTA CAAGGTGCAT ATCGATGGTT ATAACCAACT GGATATGCTG ACTGAGGGTG GCAAGAGTAA TCGACATGAG TTCTTCTTCT ATAACGAGAA TAGCTTAAAT GCATTCCGTG TTGATGAGTG GAAAGTACAC CTTAAAACCA AAACCGAATG GATTGCCCCT GCAGATGAGT GGCCATTGGG CATGATCCTC AATATTAAAG CGGATCCGTT TGAGCGCTCT CCTGATACCC GCGGTTGGTT CCTCTGGATG AAAGAGAAGA CTTGGGTACT GCCAAAGCTA CTTAAAGCAG TTGGTAAACA TCAGCAGTCA CTTAAAGCCT TCCCTCCTCG TTTAACAAAC GGCGGCATTG GTATGAACGA CAAGTCTGTC GTTGAAGAAA AGAGCAACAA ATAA
|
Protein sequence | MVNKLTKSAI ALAVVASSGV SAASTQPNVV AIMLDDVTTM DISAYHRGLG AVSTPNIDRI AERGMMVSDY YAQGSSTAGR SAFITGQYPI RTGLTSVGQP GSTRGLQKED PTLAEMLKDK GYATVHVGKS HLGDNNDHLP TVHGFDEFYG FLYHLNVMEM HEQPEFPKDP NFKGRGRNMI HTVATDKFDD TVDPRFGVIG KQTISDQGEL GAKRMQTVDG EFLDFAINWL EKHEATNDDQ PYFMWYNPTR MHQKTHVRPE YQGASQHNTY YDGLVELDDQ IGVLLDKLEA TGEIDNTIIL FTSDNGVNLD HWPDSGAASF RGQKGTTWDG GFRVPMLVSW PAKIPQGEYT DGLMSAEDWV PTIMAAAGDA DIKQDLLTGK KINDETYKVH IDGYNQLDML TEGGKSNRHE FFFYNENSLN AFRVDEWKVH LKTKTEWIAP ADEWPLGMIL NIKADPFERS PDTRGWFLWM KEKTWVLPKL LKAVGKHQQS LKAFPPRLTN GGIGMNDKSV VEEKSNK
|
| |