Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1778 |
Symbol | |
ID | 5208737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 2195829 |
End bp | 2197253 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640595386 |
Product | major facilitator transporter |
Protein accession | YP_001276118 |
Protein GI | 148655913 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000929318 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0431251 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCGCA ACGAAGAACG TACCCGGAAC CGCATCCTGC TTGTGCTGTT TGTTGGTGTT CTGATGGCAG CGCTTGATAT TGCCATCGTT GGACCAGCGC TGCCGGCGTT GCGCGAGCAT TTTGGAATCG ACGCGCGTGC GGCATCGTGG ATGTTCGTCG TCTACGTGCT CTGCAACCTG GTTGGAACGC CGTTCATCGC CAAACTGTCG GACCGGCTGG GGCGGCGCAC GCTCTACACT GCCTGTGTTG CCCTGTTTGG TCTGGGGTCA TTGATCGTCG TGGCGGCGCC GACGTATGCG CTGGTGCTGG CGGGTCGCGC TATCCAGGGG TTGGGCGCGC GTGGCATTTT CCCGGTTGCC AGTGCGGTGA TCGGTGATAC ATTTCCGCCG GAGAAGCGCG GCAGCGCGCT GGGACTGATC GGCGCGGTGT TTGGTATTGC ATTCCTGATC GGACCGATCA TCGGCGGTGT GCTGCTCCTG CTTGGATGGC AGTGGCTCTT TCTGATCAAT CTGCCGATTG CACTTGCGCT GATCGGGTTT GGCGTGAAAT TATTGCCCGC CATCCGCGCG GCAACGCCGC GTCCCTTCGA CTGGGGCGGG ACGGTCGTGC TTGGGGTGAT CCTGGCATCG CTGGCTGTGG CGCTGAGCGA TCTTGCCTAC CTGCTCGACG AAGCGAGTGT GAGCGGTCTG GTGAATGCAA TCAGTGCATC ATCGACCTGG TTCCTGCTGG TGCTGGCACT GGCGCTCATC CCGCTATTCT TGCAGATCGA GCGTCGTGCG GATGACCCGG TGCTTGACCT GAATCTGTTC CGCAACTGGC AGATTGCGCT GGCTGGCGCA CTCTCCTTTG GCGCAGGCTT GAGCGAGGCG GTAACGTTGT TCGTGCCATC GTTGCTTGTC GCAGCGTTTG GCGTCACGCC ATCAACTGCG AGTTTTATGC TTGTGCCGAT GGTGCTGGCG ATGGCGGTCG GTTCACCGCT GTCGGGGCGC ATGCTTGACC GGATTGGCTC GAAAATTGTG GTGCTGACCG GTACGGCGTT GATAGCAACA GGTTTGCTGC TGGAAGGGAT GCTGGCAACC TCTCTCGTCG CGTTCTACGG CTTTGCTGCG CTGTTTGGCA TTGGTATCGG CGTATTGCTC GGCGCATCGC TCCGGTACAT CCTGTTGAAC GAAGCGCCAG CTGCGGAACG TGGCGCGACG CAAGGGGTGC TGACGGTATT TATCAGCATT GGTCAGTTGA TCGGCGCGGT GGTGCTCGGC GCGGTTGCAG CAGCGCGCGG TAGCGATGTC GGCGGATACG CAGCGGCGTT TCTGGTTGTC GGCGTCGTGA TGCTGGCGCT CTTCATCGCT TCGTTCGGTT TGAAGAGTCG CGCCGAGGAA CTGGCGACCC GCCAGCAATG GCAGAGCGGA GCTTCGGCGG CATGA
|
Protein sequence | MTRNEERTRN RILLVLFVGV LMAALDIAIV GPALPALREH FGIDARAASW MFVVYVLCNL VGTPFIAKLS DRLGRRTLYT ACVALFGLGS LIVVAAPTYA LVLAGRAIQG LGARGIFPVA SAVIGDTFPP EKRGSALGLI GAVFGIAFLI GPIIGGVLLL LGWQWLFLIN LPIALALIGF GVKLLPAIRA ATPRPFDWGG TVVLGVILAS LAVALSDLAY LLDEASVSGL VNAISASSTW FLLVLALALI PLFLQIERRA DDPVLDLNLF RNWQIALAGA LSFGAGLSEA VTLFVPSLLV AAFGVTPSTA SFMLVPMVLA MAVGSPLSGR MLDRIGSKIV VLTGTALIAT GLLLEGMLAT SLVAFYGFAA LFGIGIGVLL GASLRYILLN EAPAAERGAT QGVLTVFISI GQLIGAVVLG AVAAARGSDV GGYAAAFLVV GVVMLALFIA SFGLKSRAEE LATRQQWQSG ASAA
|
| |