Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1058 |
Symbol | |
ID | 4241943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 1654059 |
End bp | 1655711 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638106290 |
Product | sulphate transporter |
Protein accession | YP_720902 |
Protein GI | 113474841 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.161168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.396315 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACTTG TTAACAGTTT ACATTTCAAT AACCTGCGCG GTGATTTATT GGGTGGTCTG ACTGCGGCAA TTGTTGCACT ACCTCTAGCT TTAGCATTCG GGGTATCTTC CGGTGCTGGA GCAATCACAG GTCTCTACGG TGCCATTTTT GTTGGCTTTT TTGCTGCTCT TTGTGGAGGT ACACCCTCAC AGATTACCGG TCCGACTGGT CCTATGACCG TACTCATGGC AACGGTTTTT ACCACTCTGC TCGCTGATAA CCCAGATGCC GGTCTGGAGA TGGCATTTAC CGTTGTCATG CTCGGCGGAA TATTCCAGAT ACTTTTTGGG GTTCTTCAGC TAGGAAAATA TATTGTCCTA ATTCCCTACG CTGTAATCTC AGGATTTATG TCCGGAGTCG GTGTAATTAT TATCATCATT CAAATCGGCC CGTTTCTCGG TCATCCGGCC TCTGCTAGTG TAGCTCAATC TATCAAGAAA ATCCCAGAAT TTTTAATTAA TCTTAATCCC GCTGCAGTAG GGTTAGGAAT TCTCACAATA GTAATTCTTT TATTCACACC ACGTAAAGTA ACTGCGATTA TACCGTCTCC ATTGTTAGCA TTATTAACTG GAACTTTGAT TTCGGTATTC TTCTTGTCAG ACAGTAATCT TATACTTATT GGAGAAATCC CTACTGGTTT GCCCAAGCTA CACTTACCAG TATTTACCTT CAATCAACTA CAAAATATGC TTGTGGATGG TTTGGTACTA GGAACCCTAG GGTCTATTGA CTCCCTACTC ACCTCTCTGG TTGCTGATAA CATTACCCGA AGCAATCATG ACTCCGATCA CGAATTAATT GGTCAAGGAA TTGGAAATAT AATGTCTGGA TTATTTGGTG GACTACCAGG AGCGGGAGCA ACAATGCGGA CTGTTGTTAA CGTTCATGCT GGTGGAAAAA CTCCCTTATC GGGTATTGTC CATTCTATTA TTCTATTACT TATCCTTTTG TGGGCTGGAA AGTTGACTGA AGCAATTCCT CAAGCTATCC TCGCAGGTAT CTTACTCAAA GTCGGAGTTG ATATTATTGA CTGGGGATTT CTCAAACGAG CACATAATTT ATCACTCAAA GCTGCTGGCA TTATGTACAG CGTATTGATA CTGACTGTAT TCGTTAATCT AGTTACTGCA GTCGCAGTTG GTATATTTAT TGCAAATCTC TTGACTGTCA AACGTCTTAG TGATATGCAA ATTAATGATA TTAAGGCTAT AGTTGAACCG AATGATGAAA TTCCTTTACG GGATCAAGAA AAACAACTAC TTAAGGAATG TCGGGGTCAT CTTCTTCTAT TACATCTTGG TGGACCAATG AGTTTTGGGG CTGCCAAGGC TCTCTCTAGA CACATGGGGA TGGTACAACA GTATGATGTT CTTATTCTTG ACCTCAGCGA TGTTTCCTAC CTTGGAATTA CTATGTCCCT TGCTCTTGAA AATATGGTTA CAGAAGCAAG TCGAAAACAT CGTGAGGTGT TTATTGTAGG TGCATCTGGT GGAGTTAAAA CTAGACTGGG AAAGTTAAAG ATATGGAACT TTGTACCTCG ACAGAACCTA GTAGGAACAC GCATCAAGGC TTTGCAACAG ACACTTAACC TCCTGGTTGA ACGAAAAATT TAA
|
Protein sequence | MQLVNSLHFN NLRGDLLGGL TAAIVALPLA LAFGVSSGAG AITGLYGAIF VGFFAALCGG TPSQITGPTG PMTVLMATVF TTLLADNPDA GLEMAFTVVM LGGIFQILFG VLQLGKYIVL IPYAVISGFM SGVGVIIIII QIGPFLGHPA SASVAQSIKK IPEFLINLNP AAVGLGILTI VILLFTPRKV TAIIPSPLLA LLTGTLISVF FLSDSNLILI GEIPTGLPKL HLPVFTFNQL QNMLVDGLVL GTLGSIDSLL TSLVADNITR SNHDSDHELI GQGIGNIMSG LFGGLPGAGA TMRTVVNVHA GGKTPLSGIV HSIILLLILL WAGKLTEAIP QAILAGILLK VGVDIIDWGF LKRAHNLSLK AAGIMYSVLI LTVFVNLVTA VAVGIFIANL LTVKRLSDMQ INDIKAIVEP NDEIPLRDQE KQLLKECRGH LLLLHLGGPM SFGAAKALSR HMGMVQQYDV LILDLSDVSY LGITMSLALE NMVTEASRKH REVFIVGASG GVKTRLGKLK IWNFVPRQNL VGTRIKALQQ TLNLLVERKI
|
| |