Gene Tery_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2047 
Symbol 
ID4243651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3192988 
End bp3194634 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content40% 
IMG OID638107158 
ProductNa+/solute symporter 
Protein accessionYP_721761 
Protein GI113475700 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR03648] probable sodium:solute symporter, VC_2705 subfamily 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.91078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.880017 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTACC TCTACATTGG ATGGCGATCG CGAGTCCAAG ATAGTAAAGG TTTTTTTGTT 
GCAGATCAAG GTGTGCCTGC AATTGCTAAT GGTGCTGCTA CTGCAGCGGA CTTTATGTCG
GCAATTTCAT TTATTTCTAT AGCAGGGGCA GTATCAATTT TAGGTTCTGA TGGTTCTTAT
TATGTAGCAG CAGGAAGTGG AGGTTATGTA CTCTTGGGAC TGCTGCTGGC TCCATATCTG
CGAAAGTTTG GTAAATATAC TTTACCAGAT TTTATAGGCG ATCGCTACTA CTCTAATGTT
GCTCGTATTA TTGCAGTTAT TGCTGCCCTC ATTATATCTA TAACTTTTAT TGTCGGGCAA
ATGCGAGGTG TTGGTATTGT CTTCAGTAGA TTTTTGCAAG TCCCTATTGA AGTTGGTGTT
GTTATCGGCA TGGTTATAGT TGCCTTCTTT GCCATCTTAG GAGGAATGAA GGGCATTACT
TGGACTCAGG TTGCTCAATA TTTTATACTT ATTGTTGCTT ATTTGATACC AGCTATTGCT
CTTGCTAATA CTCTGACAAA TATTCCGGTT CCACAGTTAG CTTTTACCTT TAGTGATATT
GCAGAAAAAT TGAATCAAGT TCAGGTTGAC TTGGGTTTTC CGGAATATAC TGCTGCTTTC
ACTCAAAAAA CTATGCTAGA TGTTCTATTT ATAACTATTT CTGGCATGGT TGGTCTTGCT
AGTTTACCCC ACGTTATTGT TCGTTTCTAT ACTGTACCTA ATTTGACAGC AGCTAGATAT
TCTGTAGGTT GGGCGTTGTT GTTTATTGCT GTTTTTGCTA CAACTGTTCC GGCTTTAGCT
GTTTCTGCCC GATACAATTT AATTGATACT TTACACAATA CAACTATAGA GGAGGTGCAA
AATTTAGACT GGGCAACAAA GTGGGAAAAT ACGGGTTTGT TAGAATTCAG GGATAAAAAT
AATGATGGTC GTTTGCAATT AACTCCAGAT ATGGAGACTA ATGAAATCAT TATTGACCCC
GATATTATTA CTCTCTCTAC TCCAGAAGTG GCTCAACTTC CTCCTTGGGT AATTGCTTTG
GTGGCAGCAG GAGGAGTCGC TGCTGCTTTG TCTACGGCAT CGGGTTTGTT GTTGGTAATT
TCTAGTGCTA TTGCTCACGA TATTTATTAC CGTTTAATTA ACCCAGAGGC GTCAGAGTCA
CAAAGGTTAA TGTTGGGGAG AATAATGGTG GTATTGGCGA TCGCTATTGC TGGTTATTTC
GGCATTAACC CCCCCGGTTT TGTAATTGAG ATAGCAACTT TGGGAGTTGG TGTCGCTGCT
GGTACTTTTT TTCCAGCAAT TATTTTGGGA ATTTTTGATC GGCGCACTAA CCGAGAAGGG
GCGATCAGTG GTATGATATT TGGTTTGGTG TTTACAACTA TTTATATCAT AGGTACCAGG
TTTGCGGGAA TGCCAACATG GTTTTTTGGG ATATCTGATC AGGGTATCGG TACAGTAGGA
ATGTTGTTGA ATTTTGTTGT GAGTTTGGTA GTGTCTCGGA TGACAAGTCC TCCACCTTTG
GAAATACAAA AGATAGTGGA AGATTTACGA TCGCCTTTGG CTGCACCTGC TCCTCTTCAG
GATATTGGAG AAGAACAGTT AGATTAA
 
Protein sequence
MLYLYIGWRS RVQDSKGFFV ADQGVPAIAN GAATAADFMS AISFISIAGA VSILGSDGSY 
YVAAGSGGYV LLGLLLAPYL RKFGKYTLPD FIGDRYYSNV ARIIAVIAAL IISITFIVGQ
MRGVGIVFSR FLQVPIEVGV VIGMVIVAFF AILGGMKGIT WTQVAQYFIL IVAYLIPAIA
LANTLTNIPV PQLAFTFSDI AEKLNQVQVD LGFPEYTAAF TQKTMLDVLF ITISGMVGLA
SLPHVIVRFY TVPNLTAARY SVGWALLFIA VFATTVPALA VSARYNLIDT LHNTTIEEVQ
NLDWATKWEN TGLLEFRDKN NDGRLQLTPD METNEIIIDP DIITLSTPEV AQLPPWVIAL
VAAGGVAAAL STASGLLLVI SSAIAHDIYY RLINPEASES QRLMLGRIMV VLAIAIAGYF
GINPPGFVIE IATLGVGVAA GTFFPAIILG IFDRRTNREG AISGMIFGLV FTTIYIIGTR
FAGMPTWFFG ISDQGIGTVG MLLNFVVSLV VSRMTSPPPL EIQKIVEDLR SPLAAPAPLQ
DIGEEQLD