Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3333 |
Symbol | |
ID | 5210310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 4180855 |
End bp | 4182075 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640596931 |
Product | major facilitator transporter |
Protein accession | YP_001277644 |
Protein GI | 148657439 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.230737 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTTTCGT TTGCCGCCCG CGCGCGCGCG ACGATTGCGC GGATTTCGCC ACAGGTGTGG CGCGTGCTGA CGCACAGTTT GCTCTTCGGG CTGGCTGGCA GTATTGCCGA TCTGCTCTTC AACTTCTATC TGGTAAGCCT GGGGTATGGC GCCGACACCG CCGGATTGAT GGCGACCGTC TATCGCGGCG CAGGGGCGCT GCTCGGTCTG CCGCTCGGCA TCCTGATCGA CCGGTTTGGT GCACGAGCGT TGCTGGTGGT GGGCGCTATC GGTTTTGGTA TCGCGTATGC GCTGGTGTTG ATGGTATCGC AACTCTGGGC GCTGATCCTG TTCGTCTTCC TGGCTGGTGC GGCAAATGTG CTGACGCTCA CGGCGGTTGT GCCGCTGCTC ACCGGGATCA CCGACGAGGA GGAACGGGCT GCGGTGTTTG GCATGAATGC GTCAGCCGGA CTGATCATTG GTCTGGTCGG GAGCGGTGTG GGCGGGTTGC TTCCCGGAAC GGCAGCACTC TTCCTGGGAG TGGCGACGAA TGATACTGCC GCCTACCGGA TGGCGCTGTC AATCGTGGTT GTGCTGGGTT GTCTCTCAGC GCTGCCGGTG CTGATCGGAT TCCGCGCCAG GCAACCGGTG TTCTCACCGG CGCCTCTTGT GGCAGCACCC CAACGACACA TGCCGCCAAT GCGCCTGGTG CGCTTTGCAC TTCCCTCGCT CCTGCTCGGC ATCGGCGGTG GGTTGTTCCT GCCATTTCAG AACCTCTTCT TTCGCACTGT CTTCGGACTG AACGACGCGG TCGTCGGTGT GATGCTGGCG ATGGGCGCGC TGGGTATGGG GCTTGGCGCG CTGATGGGTG CGCCAGTAGC CGCCCGTCTG GGGTTGCGCC GGGCTGCCAG CTCCCTGCGC TTCGGGGCGG TATTCGCCGT GACACTGATG TTCGCGCCAG TTTTGCCGGT GGTGGTTGTG GGATACATGT TGCGCGGCGC TTTTGTTGCA GCCAGTTATC CGTTGAATGA TGCGCTGGTG ATGCAGTTGA CCCCGTTACG ACAACGCGGG ATCGCAATCA GTCTCATGAG TGTCCTCTGG TCGCTCGGCT GGTCGGCAGC GGCGTGGATC AGCGGACACA TTCAGGTGCA CTACGGCTTT ACCCCGGTGC TCGCCGCATC GCTCGTGGCG TATGCGCTCT CAGCGTGGGC GATCTGGACG TTGCGGGAGG AGGGGCGGTG A
|
Protein sequence | MFSFAARARA TIARISPQVW RVLTHSLLFG LAGSIADLLF NFYLVSLGYG ADTAGLMATV YRGAGALLGL PLGILIDRFG ARALLVVGAI GFGIAYALVL MVSQLWALIL FVFLAGAANV LTLTAVVPLL TGITDEEERA AVFGMNASAG LIIGLVGSGV GGLLPGTAAL FLGVATNDTA AYRMALSIVV VLGCLSALPV LIGFRARQPV FSPAPLVAAP QRHMPPMRLV RFALPSLLLG IGGGLFLPFQ NLFFRTVFGL NDAVVGVMLA MGALGMGLGA LMGAPVAARL GLRRAASSLR FGAVFAVTLM FAPVLPVVVV GYMLRGAFVA ASYPLNDALV MQLTPLRQRG IAISLMSVLW SLGWSAAAWI SGHIQVHYGF TPVLAASLVA YALSAWAIWT LREEGR
|
| |