Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spea_3275 |
Symbol | |
ID | 5663663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella pealeana ATCC 700345 |
Kingdom | Bacteria |
Replicon accession | NC_009901 |
Strand | - |
Start bp | 4017075 |
End bp | 4019033 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641237933 |
Product | sulfatase |
Protein accession | YP_001503125 |
Protein GI | 157963091 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCGAG GCTATACTAG CAACCTCAGA CATTTGGTTA GCTTGTTAGT GTTAACGGGG TTGAGTTGTT ACTTCTATTA TTTCATGGAG TGGTTATTCT TCCTCTCTCG ACCCTCAATG CTTAGTTATT TGAGCTTCAC TGACAACTTG ACCTTACTCT TCAGCGCTCC TTTACCTCTC ATTACTATTA CTCTACCTGC AGTAGGCATA TTGGGATTAT TGTCGACTCT CTTAGCTAGC ATGAAAGCAG TGCCTAAGGC TGTTTCTAAC TTGGCTCGCT ATGCTATTCC TGCAATAGTT TTGACGGGGA CGGCATTTTT AATGCTAGAT AATTTCTCAT ATACCCTGTT TGGTATCGCC TCACACTCAG CTTCAACCTT TGCTTTTCAA GCTTACTATT GGATTATCAT TTTCCTCTTT TTCATCTACT TCATAGACAA GCTCGCCAAG GCTAATTCCA CTGACGGGGT CATCGGCAAG TATTGCGGGT CTTTGCTTAA GTTGATGTTG ATATTACTCG GTCTCTCATC GGTCTCCTTA GTGGTATCGT ACAATCAACC ACTTTCTACA GAGCTAGGTG TAAGTACTGC TGCCGTGAGT GCTACACCAT TGAGGGATAA GTTACCCAAT ATTATTTTCT TAGGTGCTGA CGGGGTTAAT AGCAGCAACA TGTCTATCTA TGGCTATCAA CGGGTCACCA CACCATTTAT GGACAGTATC GCCCATGAGA GCCTTATCTA TAGAAATCAT TGGACTAATT CATCTAAAAC GACAGGCTCA ATTGGAGCGC TATTGACTGG CAAGTATCCA ACTCGGACCA AGGTTATCTT CAGGCCAGAT ACCTTTAAAG GAGAAGATAT GTTTCAGCAT CTTCCCGGAA TATTGAGAGG GCTTGGATAT TATAATATTG ATATTACCTT GAGGCATTAC ATCGACTCGG AAGATCTGAA GATGCGTAAT GCCTTCGATT ACGCAAATCA TAGGCAATTG AATCAACAAG GCTCACAACT AAAGCATTTC TTTCTACTGC GTTGGCCCAC TATGGTGCAG TTTATCGAAG AAAATGGGCT GCGGCTCTAT CAACGATTGG CGCACTTGAG TGGTTATACC TCTATGGTTA ATCCACGTAA GCAGATCCAT CAAGGTGCTG ATGCTGCGCC CCATCTTTCG GATATTGGCC GTATAAAGCA ATTGAAGGAG CAAATTCTAC AGGCTCCCAA ACCATTTTTT GCAAACGTAC ATTTGCTCGG CCCACACGGG AGCAAATTTG ACTATCAAGA AGCTATTTTT ACCGAAACAA AGCAGCAAGA TGAGCATTGG ATGGTTGATC ATTATGACAA CGCAATCTAT CAATGGGATG CATATAGTCG AGAAATATAC CAACTGTTAG ATGAGTTAGG TGAGCTGGAT AATACCCTGT TGGTATTCAG CAGCGATCAC GGCAAAGGCC ACAGTGTAAA TGAAACTCTG CCCTTGATCA TACGTTACCC GAATAAAGAG CATACAGGCA CAATAACTCA GGCATCTCAG AGAGTGGATA TTGCGCCAAC CGTATTATCC TATTTGGGAG TCACGCCGCC GCAATGGATG GACGGTCATA GTCTATTATC TCGAGGTAAC GACTTCTATC CAATATTCAT CGTCACCTCT TCGATGCAAC AGTTAACCAA TGCCGGGGAC TGGAAGGTTG CAGCTAATTT AAAGCCGCCT TTTTACTCTT TGGGCACCAT ATCTATGGCG TATTGTGGCA TTTTATATTC CATGGACATT AACGATATTC ATCAACCTTT GCTTTCCCAG CAAAGAGTTC ACCCTAAAGC CGCTACATGT CCAGATGTCG ATTTAGAGCC CATGTTAGCG TATGGGATGA TTGTCACCCA CCTTAAAGAG ATGGGCTACA ATACCGATCA ACTTAACGTC GAGTTGCGCT TAAGGCAGTA TAGTGTGCGG ACTGAGTGA
|
Protein sequence | MMRGYTSNLR HLVSLLVLTG LSCYFYYFME WLFFLSRPSM LSYLSFTDNL TLLFSAPLPL ITITLPAVGI LGLLSTLLAS MKAVPKAVSN LARYAIPAIV LTGTAFLMLD NFSYTLFGIA SHSASTFAFQ AYYWIIIFLF FIYFIDKLAK ANSTDGVIGK YCGSLLKLML ILLGLSSVSL VVSYNQPLST ELGVSTAAVS ATPLRDKLPN IIFLGADGVN SSNMSIYGYQ RVTTPFMDSI AHESLIYRNH WTNSSKTTGS IGALLTGKYP TRTKVIFRPD TFKGEDMFQH LPGILRGLGY YNIDITLRHY IDSEDLKMRN AFDYANHRQL NQQGSQLKHF FLLRWPTMVQ FIEENGLRLY QRLAHLSGYT SMVNPRKQIH QGADAAPHLS DIGRIKQLKE QILQAPKPFF ANVHLLGPHG SKFDYQEAIF TETKQQDEHW MVDHYDNAIY QWDAYSREIY QLLDELGELD NTLLVFSSDH GKGHSVNETL PLIIRYPNKE HTGTITQASQ RVDIAPTVLS YLGVTPPQWM DGHSLLSRGN DFYPIFIVTS SMQQLTNAGD WKVAANLKPP FYSLGTISMA YCGILYSMDI NDIHQPLLSQ QRVHPKAATC PDVDLEPMLA YGMIVTHLKE MGYNTDQLNV ELRLRQYSVR TE
|
| |