Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_0366 |
Symbol | |
ID | 3774887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 357549 |
End bp | 359732 |
Gene Length | 2184 bp |
Protein Length | 727 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637798772 |
Product | putative sulfate transporter |
Protein accession | YP_399385 |
Protein GI | 81299177 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.972082 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.273026 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTGGG TAGGGCAACG GCTGATCGCT GATCTGAAGT CAGCACCGTT GCGCAACATC TTGGCAGGAC TTGTCTCAGG TCTATTGCTA CTGGGTGAAA ACATTTCTGC CGGCTTGCTG ATATATAGCA ATGTCCTGAG TCCCTACTTG GCTTCAGGTC TGGCAGCACT CTTGATCAGT ACGGCCTGTC TCAATTTTTT TGCTGTGAAT CGCCGTGGGT TGCCCTACGG TTTGGCGACG CCGGACTCAC GGATCTACTC TTTGTTGGCT GTGATCGCCG CAGCCATCAG TCAGTCAGAA GCCTTGGCAG ACACCCGCAT GTTGGGCGTG ACCCTGTTGA TCTACTGGGT TGTGGCCACA GCCTGTATTG GGGTGGCGCT TTGGTTAATG GGTCGCTGTG GAATTGGCGA ATGGTTGCGC TACATTCCGT ACCCGGTTGT GGGGGGCTTT TTAGCGGCGA CGGGTTGGCT GCTACTGACC GGGGGGCTGA ATGTTGCCTT GGGCTTCAAG ATTTCTGGGG CAACCTTAGA ACGCCTCTGG AGCGTGGATT CGGCTGCCAA AATCCTGGTT GCAGTCGTCT TTGGCTTGTT GCTCTGGCAA TTGGGCACGC GATCGAAGCG TGTTTGGATT ACGCCAGTCC TGCTGCTAGC TGGCTTGGTA GGTGCACAGT TGGTGCGGGT TGGTTTGGGC TACTCCCTTG CAGAGGCAGC GCAATTCGGC TGGTTTTTGC CGCCGCTTGC CCCTCGCATC GTCTTCTTTT GGGATTGGCC CCGAATTCCT AGCATCGACT GGGCTTTGCT GCTGCCGCAG ATTGCCCTGA TCCCAGTTCT GATTTTGCTG GCGGCAATCT CGCTAACGCT CAATCTCAGT AGCCTCGAAA ACGTTGAAAA ACGAGAGCTG GATCTCAATC AGGAATTGCA AAGAACGGGG CTTGCAAACC TGATTATGTT GCCGTTCGGG GGCTTTTCCC CCGGCTTACT CTCCGGTAGC CGCAGTACCT TAAACCGCTT GGCTGGTGCC TCAACTCCTC TGGCGGCTGG GGTTGCTGCG GTTCTAATGT TGATTGCGAT CGCCCTTGGA GACTTAGCGT CTTGGTTGTG TAAGCCCATC TTGGCGGGTC TCCTGGTTAA CATCGGATTG GTCTACATCG ATCGCTGGGT GTTTCGCTCG ATTCGACTGC TACCTCAACG GGAATATTGG GTTACGATCG CGATTTTAGG TATTAGTATT GTCGGAGGAT TTCTAGAGGC AATTACCGCC GGCATTGTTT TCAGTAGCTT GACCTTTGTT GTTAGTTATT GTCGTGCTCC TGCCATCCGA TCAATGGCGA CTGGTGCCTA TCTCCACAGC AATATTGAGC GGGTAGAAAC GCAACAATCA ATTCTGCGGC GGCGCGGTGC TGTGCTCCAA GTGGTGTCGC TCCAAGGCTA CCTATTCTTT GGAACGGCTC GCCGGATTGT GTCAACAGTT CGCGATCGCC TGCAGCAAAG TAAAGTTTCA ATCAACTTAG TGCTGTTGGA CTATCAAGCC GTGACAGGAG CGGATTCCTC AACGGCTGAA GTCTTCAGCC GCTTTGCCGT GGAATTGCAA GCGCAATCGA TTCAACTTTG GGTGGCGGCG ATCGCCAGCG ACTATCAGAA ACCCCTGCGA CCATTTTTGC AGCTACTACC CGAGCATCAG CAATTTCCTG ATTTGAATAC AGCCTTACAA GCTGCGGAAG CCAAACTGCT GGATCAGTAT GCAAGACGGC CCAAAAGTTT ACCGTTTGCT TTGCTCTTAC CAGAATTATT AGCACTTCCC GATGACTGTG ATGTCCCACT GCAGTTCTGG CAGCGATCGC AACTGAAAGA CGGTGAAGTG CTCTATCAAC AGGGCGATCG CGCGGATACT ATTTACTGGC TAGAACGCGG TGAATTGCAT CTGCAAGCCA CTGAGGCATG GGATGCCCGT CAAGTGCTGG CAGGCAGCCC CTGCGGTGAA CTCGCGTTTT TACGCGGAGA TACCCAGCCT CAAACAGCGA TCGCCGTGGG TCGATGTATG GTCTACGGGC TAAACCGTGC GGCTTTGACT GAACTTGAGG CCAGCTATCC CACTGTTGCG ATCGCGCTCT ACCGTTGGCT ACTGCACCGC CAGAGTGAGC AATTGCAGGA TCAACAACTG CGGCAGCACT ACTTGCAGGC CTAG
|
Protein sequence | MAWVGQRLIA DLKSAPLRNI LAGLVSGLLL LGENISAGLL IYSNVLSPYL ASGLAALLIS TACLNFFAVN RRGLPYGLAT PDSRIYSLLA VIAAAISQSE ALADTRMLGV TLLIYWVVAT ACIGVALWLM GRCGIGEWLR YIPYPVVGGF LAATGWLLLT GGLNVALGFK ISGATLERLW SVDSAAKILV AVVFGLLLWQ LGTRSKRVWI TPVLLLAGLV GAQLVRVGLG YSLAEAAQFG WFLPPLAPRI VFFWDWPRIP SIDWALLLPQ IALIPVLILL AAISLTLNLS SLENVEKREL DLNQELQRTG LANLIMLPFG GFSPGLLSGS RSTLNRLAGA STPLAAGVAA VLMLIAIALG DLASWLCKPI LAGLLVNIGL VYIDRWVFRS IRLLPQREYW VTIAILGISI VGGFLEAITA GIVFSSLTFV VSYCRAPAIR SMATGAYLHS NIERVETQQS ILRRRGAVLQ VVSLQGYLFF GTARRIVSTV RDRLQQSKVS INLVLLDYQA VTGADSSTAE VFSRFAVELQ AQSIQLWVAA IASDYQKPLR PFLQLLPEHQ QFPDLNTALQ AAEAKLLDQY ARRPKSLPFA LLLPELLALP DDCDVPLQFW QRSQLKDGEV LYQQGDRADT IYWLERGELH LQATEAWDAR QVLAGSPCGE LAFLRGDTQP QTAIAVGRCM VYGLNRAALT ELEASYPTVA IALYRWLLHR QSEQLQDQQL RQHYLQA
|
| |