Gene Tery_1539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1539 
Symbol 
ID4242018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2336377 
End bp2337459 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content34% 
IMG OID638106683 
Productglycosyl transferase, group 1 
Protein accessionYP_721293 
Protein GI113475232 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.29057 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAATA AAGTTCTTAT TAACTTATCA GTATTAATTC CCCAACCTAC AGGAATATCT 
GTCTATGCTA ATAATATTTT GCCACAGCTT CAGCCACTTA ATCCTATTCT ATTAGTTGCT
AATAATATTC CCAAATTTAA CTGCCATCTT ATCCCTAATA ATATGACTCC CGCTCAGGGA
ACTATAGGTC ATTTTCGACG ATTATTTTGG ACTCAATTTC AATTATCAAA TATATCTAAA
AAATTACAAG CAAATCTAAT TTTTTCTCCC CTTCCAGAAG CGCCATTATT TTCTGAATGT
CCTTATATTG TCATGGCTCA TGATTTAATT CCTTTGCGTT TTCCTAAACC TGGTTCTCGC
TTAACTGCTT ATTTCAAATA TTACATTCCC CAAGTATTAA GTCAGGCAAA ACATATTATT
TGTAATTCCC AAGCTACGGC AAAAGATATT ACTGATTTTT TCAAAATATC ACCTCAAAAA
ATTACTCCTA TTCCTCTAGG TTATGATTCT CAAAGATTCC AATTTTTAGA TTTACCGACA
AAAAATTATT TTCTTTATCT CGGTCGGCAC GACCATTATA AAAATTTGCA TCGGTTGATC
GAAGCTTTTA CTAATTTGCC AAATTTTTCT GAGTATGAAT TATGGTTTGC CGGACCTACA
GATAATATTT ATACACCAAC ATTAAAAACT CAAATCAAAG AACTAGGTTT AACAAATTTA
GTGAAATTTC TTGATTATGT TTCTACAGAA GAATTACCCA AAATAATTAG TCAGGCGATC
GCTATGGTTT TTCCTAGTTT GTGGGAAGGG TTTGGTTTTC CAGTATTGGA GGCGATGGCT
TGTGGTACTC CTGTTATTAC TTCTAATATT TCATCTCTGC CAGAAGTTGC TGGAGATGCT
GCTATTTTGG TTAACCCGAA AAATGTGGGA GAAATAACTG ATGCGATGAA TATTATTGCT
CAAGATGGGG GGGAGCGATC GCGTCTGATG AGCTTAAGTT TGGCTAGAGC TAAAGAGTTT
AGTTGGGAAA AAACTGGTTT AGCTACTAGG GAGATAATTC AGCAATTTTC TGTATCAGGT
TGA
 
Protein sequence
MSNKVLINLS VLIPQPTGIS VYANNILPQL QPLNPILLVA NNIPKFNCHL IPNNMTPAQG 
TIGHFRRLFW TQFQLSNISK KLQANLIFSP LPEAPLFSEC PYIVMAHDLI PLRFPKPGSR
LTAYFKYYIP QVLSQAKHII CNSQATAKDI TDFFKISPQK ITPIPLGYDS QRFQFLDLPT
KNYFLYLGRH DHYKNLHRLI EAFTNLPNFS EYELWFAGPT DNIYTPTLKT QIKELGLTNL
VKFLDYVSTE ELPKIISQAI AMVFPSLWEG FGFPVLEAMA CGTPVITSNI SSLPEVAGDA
AILVNPKNVG EITDAMNIIA QDGGERSRLM SLSLARAKEF SWEKTGLATR EIIQQFSVSG