Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1481 |
Symbol | |
ID | 4243056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 2251946 |
End bp | 2253229 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638106633 |
Product | sulfotransferase |
Protein accession | YP_721243 |
Protein GI | 113475182 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.131097 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAACA GGTACAATGA AACTGAGATT CCTTTGCCAC AAATAAAAAT GACTCAAAAT TCCTTTATTG AAATCTTACA AGTCCAAAAA TTGTCTCAAG AATCTGAATC ATTAATCGGA TTTAATATAG ACTCGCTCCA GAAAGGTAAA AAAATAGATA ATTATTCAAT TTCAATTGTA GGCTGGGTAT TAGGTAAGGG ATCACCTGCA GTTGCAGTAG AATTTCTTAA TGGAGGAAAA GTATTACAGA AAGTTCCGGT CAGTAAGTCC CGTCCGGGAG TAGAAAAACG TTTCCCAAAA GTGAGCGAAG CCAAAAGTAG CGGTTTTGCT GCAGCAGTAG GGGTATTGGG ATTGCCTCCG GTAGCTAACT TAAAATTGCG GTGTTTGTTG GCTGACGGCA CTATTGTTCC CTTAGCAGAT ATAAAATTTC GCCATGAACC CCTGGATTCT GGTTATCAAC CAAGATTGCA ACCTTTAGTA TTAAATTCTA CGGGGCGATC GGGAAGTACT TGGTTTATGC GGTTGCTTTC CCAACATCCA GCAATTGTAG CTTGTCCAAT ATATGCTTAT GAAACCAGAG TATGGCCTTT TTGGATGCAA TTGCTGCAAT TTTTGTCTCA AAGAGCTAAT CCGATAGAAA CTCCTTTTCA AGCAGATTAT CCACCAAAAG CAGCAGCTTT CTGTCAGGAA ACTTTGGATG CTTTTTACGA ACATATAGCG GAAGCTCAAG GTAAGATAAT TTCTTCTTCT GAATTAACTT ATTTTGTAGA AAAGAATGCT TTTAATCCTA ATCTTGACTT ATTAACAGAA ATTTATCCCC AGGCTAAAGA AATAATTCTG GTTCGGGATT TTCGAGATGT AGCTTCTTCA ATATTAGCCT TTAATAAGAA ACGAGGTAAC GATGGTTTTA GTCGCAATAA GTTTAAAAGT GATGAGGAAT ATATTAAAGG AGTTATGAAA AATGCGTCAT CTGGTTTGCG GGAGCGGTGG AAGAAGCGCA GACAAGAAGC ATATTTAGTC CATTACGAAG ATTTAATTCT CTCCCCAGAA AAAACTTTAA AGGGAGCTCT AGAATATTTA AACTTAGATA ACTCTCAGGA GGCGATCGCT AATATGCTAG AAAAAGCATC GGAAAATACT GCCTATACCC AATGGCATCA AACTAGCTCT AACGCTAAAG ACTCTATCGG AAGGTGGCGG CAAGATCTAG AACCTTCCCT GCAAAAATTA TGCAATCGGG TTTTAGCTGA TGCTTTGCGA GAATTTGGAT ATTCTGTTAA CTAA
|
Protein sequence | MRNRYNETEI PLPQIKMTQN SFIEILQVQK LSQESESLIG FNIDSLQKGK KIDNYSISIV GWVLGKGSPA VAVEFLNGGK VLQKVPVSKS RPGVEKRFPK VSEAKSSGFA AAVGVLGLPP VANLKLRCLL ADGTIVPLAD IKFRHEPLDS GYQPRLQPLV LNSTGRSGST WFMRLLSQHP AIVACPIYAY ETRVWPFWMQ LLQFLSQRAN PIETPFQADY PPKAAAFCQE TLDAFYEHIA EAQGKIISSS ELTYFVEKNA FNPNLDLLTE IYPQAKEIIL VRDFRDVASS ILAFNKKRGN DGFSRNKFKS DEEYIKGVMK NASSGLRERW KKRRQEAYLV HYEDLILSPE KTLKGALEYL NLDNSQEAIA NMLEKASENT AYTQWHQTSS NAKDSIGRWR QDLEPSLQKL CNRVLADALR EFGYSVN
|
| |