Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2047 |
Symbol | |
ID | 4243651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 3192988 |
End bp | 3194634 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638107158 |
Product | Na+/solute symporter |
Protein accession | YP_721761 |
Protein GI | 113475700 |
COG category | [R] General function prediction only |
COG ID | [COG4147] Predicted symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family [TIGR03648] probable sodium:solute symporter, VC_2705 subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.91078 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.880017 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTACC TCTACATTGG ATGGCGATCG CGAGTCCAAG ATAGTAAAGG TTTTTTTGTT GCAGATCAAG GTGTGCCTGC AATTGCTAAT GGTGCTGCTA CTGCAGCGGA CTTTATGTCG GCAATTTCAT TTATTTCTAT AGCAGGGGCA GTATCAATTT TAGGTTCTGA TGGTTCTTAT TATGTAGCAG CAGGAAGTGG AGGTTATGTA CTCTTGGGAC TGCTGCTGGC TCCATATCTG CGAAAGTTTG GTAAATATAC TTTACCAGAT TTTATAGGCG ATCGCTACTA CTCTAATGTT GCTCGTATTA TTGCAGTTAT TGCTGCCCTC ATTATATCTA TAACTTTTAT TGTCGGGCAA ATGCGAGGTG TTGGTATTGT CTTCAGTAGA TTTTTGCAAG TCCCTATTGA AGTTGGTGTT GTTATCGGCA TGGTTATAGT TGCCTTCTTT GCCATCTTAG GAGGAATGAA GGGCATTACT TGGACTCAGG TTGCTCAATA TTTTATACTT ATTGTTGCTT ATTTGATACC AGCTATTGCT CTTGCTAATA CTCTGACAAA TATTCCGGTT CCACAGTTAG CTTTTACCTT TAGTGATATT GCAGAAAAAT TGAATCAAGT TCAGGTTGAC TTGGGTTTTC CGGAATATAC TGCTGCTTTC ACTCAAAAAA CTATGCTAGA TGTTCTATTT ATAACTATTT CTGGCATGGT TGGTCTTGCT AGTTTACCCC ACGTTATTGT TCGTTTCTAT ACTGTACCTA ATTTGACAGC AGCTAGATAT TCTGTAGGTT GGGCGTTGTT GTTTATTGCT GTTTTTGCTA CAACTGTTCC GGCTTTAGCT GTTTCTGCCC GATACAATTT AATTGATACT TTACACAATA CAACTATAGA GGAGGTGCAA AATTTAGACT GGGCAACAAA GTGGGAAAAT ACGGGTTTGT TAGAATTCAG GGATAAAAAT AATGATGGTC GTTTGCAATT AACTCCAGAT ATGGAGACTA ATGAAATCAT TATTGACCCC GATATTATTA CTCTCTCTAC TCCAGAAGTG GCTCAACTTC CTCCTTGGGT AATTGCTTTG GTGGCAGCAG GAGGAGTCGC TGCTGCTTTG TCTACGGCAT CGGGTTTGTT GTTGGTAATT TCTAGTGCTA TTGCTCACGA TATTTATTAC CGTTTAATTA ACCCAGAGGC GTCAGAGTCA CAAAGGTTAA TGTTGGGGAG AATAATGGTG GTATTGGCGA TCGCTATTGC TGGTTATTTC GGCATTAACC CCCCCGGTTT TGTAATTGAG ATAGCAACTT TGGGAGTTGG TGTCGCTGCT GGTACTTTTT TTCCAGCAAT TATTTTGGGA ATTTTTGATC GGCGCACTAA CCGAGAAGGG GCGATCAGTG GTATGATATT TGGTTTGGTG TTTACAACTA TTTATATCAT AGGTACCAGG TTTGCGGGAA TGCCAACATG GTTTTTTGGG ATATCTGATC AGGGTATCGG TACAGTAGGA ATGTTGTTGA ATTTTGTTGT GAGTTTGGTA GTGTCTCGGA TGACAAGTCC TCCACCTTTG GAAATACAAA AGATAGTGGA AGATTTACGA TCGCCTTTGG CTGCACCTGC TCCTCTTCAG GATATTGGAG AAGAACAGTT AGATTAA
|
Protein sequence | MLYLYIGWRS RVQDSKGFFV ADQGVPAIAN GAATAADFMS AISFISIAGA VSILGSDGSY YVAAGSGGYV LLGLLLAPYL RKFGKYTLPD FIGDRYYSNV ARIIAVIAAL IISITFIVGQ MRGVGIVFSR FLQVPIEVGV VIGMVIVAFF AILGGMKGIT WTQVAQYFIL IVAYLIPAIA LANTLTNIPV PQLAFTFSDI AEKLNQVQVD LGFPEYTAAF TQKTMLDVLF ITISGMVGLA SLPHVIVRFY TVPNLTAARY SVGWALLFIA VFATTVPALA VSARYNLIDT LHNTTIEEVQ NLDWATKWEN TGLLEFRDKN NDGRLQLTPD METNEIIIDP DIITLSTPEV AQLPPWVIAL VAAGGVAAAL STASGLLLVI SSAIAHDIYY RLINPEASES QRLMLGRIMV VLAIAIAGYF GINPPGFVIE IATLGVGVAA GTFFPAIILG IFDRRTNREG AISGMIFGLV FTTIYIIGTR FAGMPTWFFG ISDQGIGTVG MLLNFVVSLV VSRMTSPPPL EIQKIVEDLR SPLAAPAPLQ DIGEEQLD
|
| |