Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1572 |
Symbol | |
ID | 5208527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 1921592 |
End bp | 1922839 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640595178 |
Product | major facilitator transporter |
Protein accession | YP_001275914 |
Protein GI | 148655709 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.58248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCAT CCACGTCTTC GATGAACCCC TCCCGTTTCG TGGCGCTGCG CTATCGCGAC TTCCGGCTCC TCTGGCTTGG TCAGTTTGTG TCGATCACCG GCACGCAGAT GCGCAACGTG GCTATCGCCT GGCAGATCTA TCAACTGGCG CGTGTTGACA GCAGCATTCA GCCTGAACTC GCACTCGGTC TGATCGGTCT GGCGCGCGTC GTCCCACTGG TTGTCACGGC GCTCGTGAGC GGTATGGTCG CAGATCGCGT CGAACGACGC AGCATGCTGA TCCTGACTTC GCTGGCAGCG CTTGGGTGCT CAGTTGTACT CGCGTTCGCC GCCGAAACGG AGCGCCCGCC GCTGGCGCTG ATCTACGCGA TGGTGGCGCT GGCATCGGTG GCAGGCGCGT TCGAGTTGCC AGCGCGTCAG GCGATTATTC CCAATCTCGT GGCGCCGCAG CATCTGCCCA ATGCCCTGAG CCTGAATATC GTCGCCTGGC AACTTGCGAC CGTGATCGGT CCGGCGTTGT CCGGCATCCT GATCGCGGCG GTTGGGGTTG CACCGGTGTA CTGGATCGAT GCTGCCAGTT TTCTCGCAGT GGTTGTTGCA GCGCTACTTA TGCGCACGCG CAATCTCCCC GCGCGCGCTG AACCGGTCTC GTTGCAGGCG GCGCTGGCAG GGTTGCGCTT CGTCTTTTCG CATCGCCTGA TTGCAGCAAC GATGCTGCTT GATTTCTTTG CCACGTTCTT TGGCGCTACC GGTGTGTTGC TGCCGGTCTT CGCCGATCAG GTGCTGCGGG TTGGTCCAAC CGAACTGGGC TGGATGTATG CAGCGCCATC GGTCGGCGCG GTGATCGCCG CCACGCTGCT GAGCGGCGTG CGCATCCCGC GTCAGGGGAT GACGCTCCTG GCGGCAGTGC TGCTCTTTGG CGTATGCGTC GTGATCATCG GCGTGTCGCG CTGGCTGCCG TTGACACTGC TGGCGCTGGC AGGCATGGGC GCGGCGGATA CGGTCAGCAT GGTGATCCGC GGCACGATCC GTCAGTTGCT GACCCCCGAT GAGTTGCGCG GAAGAATGGT GGCGGTCACG ATGATCTTTT TTGCTGGCGG TCCGCAACTG GGTGAAACCA ATGCCGGGTT TATCGCCAGT CTCATCGGCG CGCCTGCAGC AGTGGCGATC GATGGTGCGG CGTGTATCGT CATAGTGATC GGGACGGCGC TCAAGGTTCG TGAGTTGCGC CAGTATGACG GTTCGTGA
|
Protein sequence | MNPSTSSMNP SRFVALRYRD FRLLWLGQFV SITGTQMRNV AIAWQIYQLA RVDSSIQPEL ALGLIGLARV VPLVVTALVS GMVADRVERR SMLILTSLAA LGCSVVLAFA AETERPPLAL IYAMVALASV AGAFELPARQ AIIPNLVAPQ HLPNALSLNI VAWQLATVIG PALSGILIAA VGVAPVYWID AASFLAVVVA ALLMRTRNLP ARAEPVSLQA ALAGLRFVFS HRLIAATMLL DFFATFFGAT GVLLPVFADQ VLRVGPTELG WMYAAPSVGA VIAATLLSGV RIPRQGMTLL AAVLLFGVCV VIIGVSRWLP LTLLALAGMG AADTVSMVIR GTIRQLLTPD ELRGRMVAVT MIFFAGGPQL GETNAGFIAS LIGAPAAVAI DGAACIVIVI GTALKVRELR QYDGS
|
| |