Gene Tery_0459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0459 
Symbol 
ID4243753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp727178 
End bp728614 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content35% 
IMG OID638105776 
ProductPUCC protein 
Protein accessionYP_720390 
Protein GI113474329 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.222585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAGTA AAGATTTAAA AGGTACAAAA ATCGATAATA TTGATATGGA AAAAAATTTT 
CCTAAACTTA ATTTATTCAC TATGTTTAGA TTGGGTTTCT ATCAAATGGG ATTAGGAACA
ATGTCAGTTT TAACTCTTGG AGTTTTAAAC CGAGTTATGA TTGCTGAATT AAAAATTCCT
GCCACAATAG TAGCCATAAC TTTATCGTTA TATCAATTTA TGGCTCCAGC AAGGGTTTGG
TTTGGTCAAA TGTCTGATAC TAAGCCTTTG CTTGGTAAAC ATCGCACTGG TTATATGTGG
ATAGGAGCGG TTTTTTTTAC GGTAACTGCT TTTTTTGCTG TGCAAGCTAT GTGGCAGGTT
GGTGATAGTT TAAGGGTTAA TGGTTGGTCT AATTCTACTT ATTTTTGGAT TGGTATTTTA
GGCTTAATGT TTATATTTTA TGGTTTGGCT TTAAGTGCAA GTTCTACACC TTTTGCTGCT
TTATTAGTAG ATATTTCTGA TGAAGATAAT CGTTCTAAGG TTGTCGGTAT AGTCTGGTCA
ATGTTGATGG TGGGAATTAT TATTGGGGCT ATTACTAGTA GTTTTTTATT GAAACAAGTT
GGTGTGGATG CTCCGTTAGA AACTGTTCAA GTTTCTATTA ATAATTTGTT TATTATAATA
CCGGCAATTG TTTTGGTTTT TGCTTTTATT GGTACTGTGG GAGTTGAGGA AAAATACTCT
CGTTATGGCA GTCGTTCAAC TATTGCTAAC CGGGAGGATC AAGTTACTAT GGGAACAACT
TTAAAAATTT TAAAGGCTAA TAGACAAACT GGTTTGTTTT TTACTTTTGT GTTTGTATTA
ATTATTAGTT TGTTTATGCA GGATGCTGTT TTGGAGCCTT ATGCTGGGGA AATATTTCTG
ATGCCAATTT CTGAAAGTAC TAGGTTGAAT GCTGTTTCGG GAATAGGAAC TTTAATTGGG
TTAGGTACAA CGGGTTTTTT AGTAGTACCA AGGTTGGGAA AGAAAAATTC TTTGAAGGTT
GGATGTGTGG CAACAACAAT TAGTTTTATT TTGATAGTTA TGTCTGGGTT TACTGGTAAG
CTTTCTTTGT TTTTAAGTGC TTTATTTTTG TATGGTTTGG CTGCAGGTTT AACTACTACT
GCTGCTCTTA GTTTAATGTT AGATTTAACG GCGGCGGAAA CGGCGGGAAC TTTTATTGGG
GCTTGGGGTT TAGCACAGGC AATGGCACGA GGTTTATCTA CGGTTATTGG TGGTGTGACT
TTGGATGTTG GTCGCCGTTT ATTTAATGTT TCAATGTTGG CTTATGGGTT GGTTTTTCTA
TTAGCAGGAG TGGGAATGAT ACTTTCTATT TTCTTGCTCA ATAGGGTTAA TGTTAGAGAA
TTTCAAGATA ATGCTAGTGT GGCGATCGCT ACTATTTTAG AGGGAGAATT AGATTAG
 
Protein sequence
MTSKDLKGTK IDNIDMEKNF PKLNLFTMFR LGFYQMGLGT MSVLTLGVLN RVMIAELKIP 
ATIVAITLSL YQFMAPARVW FGQMSDTKPL LGKHRTGYMW IGAVFFTVTA FFAVQAMWQV
GDSLRVNGWS NSTYFWIGIL GLMFIFYGLA LSASSTPFAA LLVDISDEDN RSKVVGIVWS
MLMVGIIIGA ITSSFLLKQV GVDAPLETVQ VSINNLFIII PAIVLVFAFI GTVGVEEKYS
RYGSRSTIAN REDQVTMGTT LKILKANRQT GLFFTFVFVL IISLFMQDAV LEPYAGEIFL
MPISESTRLN AVSGIGTLIG LGTTGFLVVP RLGKKNSLKV GCVATTISFI LIVMSGFTGK
LSLFLSALFL YGLAAGLTTT AALSLMLDLT AAETAGTFIG AWGLAQAMAR GLSTVIGGVT
LDVGRRLFNV SMLAYGLVFL LAGVGMILSI FLLNRVNVRE FQDNASVAIA TILEGELD