Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2371 |
Symbol | |
ID | 5539852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3057305 |
End bp | 3059119 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640894503 |
Product | sulphate transporter |
Protein accession | YP_001432471 |
Protein GI | 156742342 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | [TIGR00815] high affinity sulphate transporter 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.364425 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGGCAA TGTTGAAACG ACTATCCCTG TTTCAGATTC TGCAACGGGA GTTTTCTACC TACGACATGG CGCATTTTCA GCGCGATCTC CTGGCAGGTT TGACGGTAGC AGCGGTGGGG TTGCCGCTGG CGCTGGCATT CGGCGTGGCG TCCGGCGCCG ACGCGGCGGC TGGTCTCGTG ACGGCGATTC TGGCAGGTTT GATCATTGGC GGACTGGGTG GCGCGGGGTA TCAGATCAGC GGTCCAACCG GCGCGATGTC GGCCGTGCTG ATCGTGCTGG CATCGCGGTA TGGACTGGAC GGCGTGTGGG TGGCGACGGT GATGGCGGGC ATTTTCCTGA TTGCGTTGGG AGTGTTCCGT CTGGGACGCT ATGTTTCGTT CATTCCATCG CCGGTCATTA CCGGCTTTAC GTCTGGAATC GCGCTCATTA TCGCTATCGG GCAGATCGAT AACGTCCTGG GAGTGACGAC GCCGAAAGCC GAGAATGCGC TGGAAAAACT CTGGCATTAT GTGACCAACC CCGTCGTGCC CGACTGGCAC ACCCTGGCGC TAGCCGGCAT CGTCATGGGG ACGATGATCC TCCTTCCGCG GGTGACGACG CGCATTCCCG GCTCGCTCGT CGGCATTATG ATTGCATCGC CGCTCGCCTT CTTTGCCGGA TGGGACGTGG CAGTCATCGG CGATATTCCA CGCACGATTC TGCTGAGCAA TCGCCTGACG CTGGAGAGCA TCCCGTGGGA ACACTTGTCC GAACTGTTGC CCGCCGCTGT GTCGATTGCG GCGCTTGGGG CGATCGAAAG TCTGCTGTGT GGCACAGTTG GAGCCACGAT GTCGGGCGGG GAGTTCGACA GCAATCAGGA ATTGATCGGG CAGGGCATCG GCAACCTGGT GATCCCCTTC TTTGGCGGCG TTCCGGCGAC GGCGGCGATT GCACGCACCA GCGTCGCTAT CAAGAGCGGC GCCGCTACCC GCCTGACCAG CATCATCCAT TCACTGGCGC TGCTGCTCAG CGCATTGGCG CTGGCGCCGC TGATCAGCCA CGTGCCGCTG GCAGCGCTCG GCGACGTGCT GCTCGTGACG GCATGGCGCA TGAATGAGTG GGAGTCGATC CATTTCTTTG TTCGCACTCG TCTGCGAGGC GCTCTCACCG GTATGATCGT GACCATGATC GCCACCGCAG CCCTTGACCT GACCCAGGCA ATTCTGATTG GCGTCGTCAT CTCGGCAGTG CTCTACATTC GCCAGTCAGC AATCAGCACA TCGGTAGCCA GCGGACCGGT GAAGCCAGAG AAGATTCGGG CGCAGGGGTT CGACCTTCCG GCGGCTTGCC CTGCTATTCA TGTCTACTAC CTGACCGGTC CGGTCTTTTT TGGCAGCGTG ACGACCGTGC TCGAGTCGTT CGAGACAGCA GGCGACTATC ATACCCTGAT CATCAGCATG CGTGGCGTGC CGGTGGTCGA TGCAATGGGC ATTCAGGCAT TGCAGCAGAT TGTCGAGGAA CATCACCATC GCGGCGGCAA GGTGTACTTC AGCGGATTGC AGCCAGCCGT GCGGTCGATG TTCGACCGCA CCGGCTTGAC ACAACTGGTG GGGGAGGAAT ATATCTTTTG GGACGCGGCA CAGGCGATTA TTGCCAGCCA TCAGAGCCAC GAACTCGACG GTTGCGCGAA GTGCGACTCG CGCAGTGAGA TCTGTGATGT GCTGCGCGCC GCGCGACGTC GGCATGTGGA GTCGGATGAG GCGGCAGCGC CGGCGGTCGC CACCGACTCG TTCCCATCCA CAAATATGAC ATCGGTGTCG TCGGTACAGG GGTGA
|
Protein sequence | MLAMLKRLSL FQILQREFST YDMAHFQRDL LAGLTVAAVG LPLALAFGVA SGADAAAGLV TAILAGLIIG GLGGAGYQIS GPTGAMSAVL IVLASRYGLD GVWVATVMAG IFLIALGVFR LGRYVSFIPS PVITGFTSGI ALIIAIGQID NVLGVTTPKA ENALEKLWHY VTNPVVPDWH TLALAGIVMG TMILLPRVTT RIPGSLVGIM IASPLAFFAG WDVAVIGDIP RTILLSNRLT LESIPWEHLS ELLPAAVSIA ALGAIESLLC GTVGATMSGG EFDSNQELIG QGIGNLVIPF FGGVPATAAI ARTSVAIKSG AATRLTSIIH SLALLLSALA LAPLISHVPL AALGDVLLVT AWRMNEWESI HFFVRTRLRG ALTGMIVTMI ATAALDLTQA ILIGVVISAV LYIRQSAIST SVASGPVKPE KIRAQGFDLP AACPAIHVYY LTGPVFFGSV TTVLESFETA GDYHTLIISM RGVPVVDAMG IQALQQIVEE HHHRGGKVYF SGLQPAVRSM FDRTGLTQLV GEEYIFWDAA QAIIASHQSH ELDGCAKCDS RSEICDVLRA ARRRHVESDE AAAPAVATDS FPSTNMTSVS SVQG
|
| |