Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2974 |
Symbol | |
ID | 4245090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 4625247 |
End bp | 4626683 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638108012 |
Product | SSS family solute/sodium (Na+) symporter |
Protein accession | YP_722605 |
Protein GI | 113476544 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.154355 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAAC AAATTTGGAT TGGGATAACA TTTATAGCCT TTTTGCTATC ATTTACAGTA GTAGGCATTT ACTCCGCAAC ACAAAAGCAA AATACAACAA CTGATTACTT ACTTGCCAGT AGAAATGTTA ATCCCTGGTT GACAGCACTA TCCGCAATGG CAACAGGTCA GAGTGGGTTT CTATTTATTG GTTCGATAGG TTTTATCTAT AAAGTTGGAT TTGCTGCTAT TTGGATACCC CTTGCTTGGA CAATAGGAGA CTATATTGCT TGGTTGTTAA TATTTAAAAG GTTGAGGTTA GTTTCTCAGG AAACAGACTC AGATACAATC TCCTCATTCT TAGGTCAAGA AAATCTAAGT CCAAAAAATC AAGGGCGCTC GATTACAATA ATTTCAGCAC TAATTACCAT AGGAATTCTG GGTACTTATG CTGCAGCTCA ACTGGTAGCA GCAAGCAAAG GACTGAATGC TATATTTGGT TGGAACTATG AACTGGGTAT TATTGCTGGG GCTGTAATTG TGGTTGTCTA CTGTTTTTCA GGAGGTATCC GTGCTTCTAT ATGGACTGAC TCTGTGCAGG GAATTTTAAT GATATTATCT CTGTTGATTT TGTGTATAGT AAGTTTACTG GCTTGTGGAG GGTTGACAGA ACTTTGGGTC AAGCTTAATG CCATTGACCC CACTCTAACA AATTGGATGC CTACTAATTT ACCTTGGGGG TTTTTTCCTT ACTTTTTGGG GTGGTTGGTG TCAGGCTTGG GTGTTGTCGG TCAACCTCAT GTATTAGTAA GAGCAATGGC AATTGACTCT GCAGATAATA TAGCGTTAGC TCGTAACATA AAATTAGTCT GCGGTCTAAT GAATTCGGCT ACAGCTTTTG GTATAGGATT AACTGCCAGA GTTTTGTTAC CTGAATTAAT GACATCTGGT GACCCAGAGT TAGCATTACC GAATCTATCT ATAGAATTAT TGCCAGCAGT TTTAGTAGGG TTGATGTTAG CAGGACTTTT TTCTGCAGCT ATTTCTACAG CAGATTCTCA AATATTATCA TGTTCTGCTG CACTAAGTCA AGATTTAGTT CCCAGTGGAT CTAACTCTTA TCGAAAAGCT AAAATTGCTA CCTTAGCTGT TACTGCTTTT GTATTAGCGA TCGCTCTCAT AACAAACAAT AGTGTATTTG CTTTGGTCAT TTTCTCTTGG TCAGTTTTAG CCTGCGCTTT AGGTCCGTTG TTAGTATTGC GAGTGTGGCA AAAACCTGTA AGGGTTCCAG TCGCAATAAC AATGATGATT ACTGGTATAG TAGTTGCGAT TATATGGAAT AAAGGCTTTA ACCTATCAAG CGCTATTTAT GAAGTCTTGC CTGGTATGGC AGCAGGCTTT ATTGTTTATG GAATTGCTAA TTTGCAAATT TGGCCTAAAG ATTTGAGTAA ACAATAA
|
Protein sequence | MEKQIWIGIT FIAFLLSFTV VGIYSATQKQ NTTTDYLLAS RNVNPWLTAL SAMATGQSGF LFIGSIGFIY KVGFAAIWIP LAWTIGDYIA WLLIFKRLRL VSQETDSDTI SSFLGQENLS PKNQGRSITI ISALITIGIL GTYAAAQLVA ASKGLNAIFG WNYELGIIAG AVIVVVYCFS GGIRASIWTD SVQGILMILS LLILCIVSLL ACGGLTELWV KLNAIDPTLT NWMPTNLPWG FFPYFLGWLV SGLGVVGQPH VLVRAMAIDS ADNIALARNI KLVCGLMNSA TAFGIGLTAR VLLPELMTSG DPELALPNLS IELLPAVLVG LMLAGLFSAA ISTADSQILS CSAALSQDLV PSGSNSYRKA KIATLAVTAF VLAIALITNN SVFALVIFSW SVLACALGPL LVLRVWQKPV RVPVAITMMI TGIVVAIIWN KGFNLSSAIY EVLPGMAAGF IVYGIANLQI WPKDLSKQ
|
| |