Gene Tery_1481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1481 
Symbol 
ID4243056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2251946 
End bp2253229 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content38% 
IMG OID638106633 
Productsulfotransferase 
Protein accessionYP_721243 
Protein GI113475182 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.131097 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAACA GGTACAATGA AACTGAGATT CCTTTGCCAC AAATAAAAAT GACTCAAAAT 
TCCTTTATTG AAATCTTACA AGTCCAAAAA TTGTCTCAAG AATCTGAATC ATTAATCGGA
TTTAATATAG ACTCGCTCCA GAAAGGTAAA AAAATAGATA ATTATTCAAT TTCAATTGTA
GGCTGGGTAT TAGGTAAGGG ATCACCTGCA GTTGCAGTAG AATTTCTTAA TGGAGGAAAA
GTATTACAGA AAGTTCCGGT CAGTAAGTCC CGTCCGGGAG TAGAAAAACG TTTCCCAAAA
GTGAGCGAAG CCAAAAGTAG CGGTTTTGCT GCAGCAGTAG GGGTATTGGG ATTGCCTCCG
GTAGCTAACT TAAAATTGCG GTGTTTGTTG GCTGACGGCA CTATTGTTCC CTTAGCAGAT
ATAAAATTTC GCCATGAACC CCTGGATTCT GGTTATCAAC CAAGATTGCA ACCTTTAGTA
TTAAATTCTA CGGGGCGATC GGGAAGTACT TGGTTTATGC GGTTGCTTTC CCAACATCCA
GCAATTGTAG CTTGTCCAAT ATATGCTTAT GAAACCAGAG TATGGCCTTT TTGGATGCAA
TTGCTGCAAT TTTTGTCTCA AAGAGCTAAT CCGATAGAAA CTCCTTTTCA AGCAGATTAT
CCACCAAAAG CAGCAGCTTT CTGTCAGGAA ACTTTGGATG CTTTTTACGA ACATATAGCG
GAAGCTCAAG GTAAGATAAT TTCTTCTTCT GAATTAACTT ATTTTGTAGA AAAGAATGCT
TTTAATCCTA ATCTTGACTT ATTAACAGAA ATTTATCCCC AGGCTAAAGA AATAATTCTG
GTTCGGGATT TTCGAGATGT AGCTTCTTCA ATATTAGCCT TTAATAAGAA ACGAGGTAAC
GATGGTTTTA GTCGCAATAA GTTTAAAAGT GATGAGGAAT ATATTAAAGG AGTTATGAAA
AATGCGTCAT CTGGTTTGCG GGAGCGGTGG AAGAAGCGCA GACAAGAAGC ATATTTAGTC
CATTACGAAG ATTTAATTCT CTCCCCAGAA AAAACTTTAA AGGGAGCTCT AGAATATTTA
AACTTAGATA ACTCTCAGGA GGCGATCGCT AATATGCTAG AAAAAGCATC GGAAAATACT
GCCTATACCC AATGGCATCA AACTAGCTCT AACGCTAAAG ACTCTATCGG AAGGTGGCGG
CAAGATCTAG AACCTTCCCT GCAAAAATTA TGCAATCGGG TTTTAGCTGA TGCTTTGCGA
GAATTTGGAT ATTCTGTTAA CTAA
 
Protein sequence
MRNRYNETEI PLPQIKMTQN SFIEILQVQK LSQESESLIG FNIDSLQKGK KIDNYSISIV 
GWVLGKGSPA VAVEFLNGGK VLQKVPVSKS RPGVEKRFPK VSEAKSSGFA AAVGVLGLPP
VANLKLRCLL ADGTIVPLAD IKFRHEPLDS GYQPRLQPLV LNSTGRSGST WFMRLLSQHP
AIVACPIYAY ETRVWPFWMQ LLQFLSQRAN PIETPFQADY PPKAAAFCQE TLDAFYEHIA
EAQGKIISSS ELTYFVEKNA FNPNLDLLTE IYPQAKEIIL VRDFRDVASS ILAFNKKRGN
DGFSRNKFKS DEEYIKGVMK NASSGLRERW KKRRQEAYLV HYEDLILSPE KTLKGALEYL
NLDNSQEAIA NMLEKASENT AYTQWHQTSS NAKDSIGRWR QDLEPSLQKL CNRVLADALR
EFGYSVN