Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0040 |
Symbol | |
ID | 4895081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 49785 |
End bp | 51023 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640110616 |
Product | major facilitator transporter |
Protein accession | YP_001041932 |
Protein GI | 126460818 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAAT CTCCGATCTT CACGCCCGTC CTGATCTCGG GCTGCATCGT CCTGATGCTG GGCTTTGCGA TCCGCGCCAG CTTCGGCGTG TTCCAGATCC CCATCGCCGA GGAGTTCGAC TGGCCGCGGT CCGACTTCTC GATGGCCATC GCGATCCAGA ACCTCGCCTG GGGCATCGGC CAGCCGATCT TCGGGATGCT GGCCGAGAAG TTCGGCGACC GCCGGGCCAT CGTCGCGGGC GCGCTCACCT ATGCGGCGGG TCTCGTGCTC TCGAGCTTCG CCGTGACGCC GCTCCAGCAT CAGTTCCTCG AGGTGCTGGT GGGGTTCGGG ATCGCGGGCA CGGGCTTCGG CGTGATCCTT GCGGTGGTGG GGCGGGCCAC GGCGCCTGAG CATCGCTCGC TGGCGCTCGG CATCGCCACG GCTGCGGGGT CGGCGGGTCA GGTCTTCGGG GCGCCCGCGG CCGAGATCCT GCTGGGCTTC TACAGCTGGC AGACAGTGTT CGTGATCTTC GCGGGCGTCA TCCTTGCCGC GCTCTTTGCG CTGCCCTTCA TGCGTGCGCC GGTCACCGCG ACGAAGGCCG AGCTCGAGGA GTCGCTCGGC ACGGTGCTCA GACGGGCCTT CCGCGATCCG TCCTATACGC TGATCTTCGT GGGCTTCTTC TCCTGCGGCT ATCAGCTGGC CTTCATCACC GCGCACTTCC CCGCCTTCGT GACGGAGATG TGCGGGGCGA TCGATCCGCG CGGGCCGCTG GCGGCGCTGG GGATCACCAC CACCTCGGCG CTGGGCGCAC TGGCGATCTC GCTGATCGGG CTGGCCAACA TCGCGGGCAC GATCACCGCA GGCTGGCTCG GCAAGCGCTA CTCGAAGAAA TACCTGCTGG CCGCGATCTA TACCGGGCGC ACGCTTGCGG CCGCGCTCTT CATCCTCGTG CCGATGACGC CCACCACGGT CCTTCTCTTC TCCCTCAGCA TGGGCGCGCT GTGGCTGGCG ACCGTGCCGC TCACGAGCGG GCTCGTGGCC CATCTCTACG GCCTGCGCTA CATGGGCACG CTCTACGGGT TCGTCTTCCT CAGCCATCAG CTCGGCAGCT TCATGGGCGT CTGGCTGGGC GGGCGGATGT ATGACATGAC CGGCGACTAT ACGATGGTCT GGTGGATCGG CGTGGGCGTC GGCGCCTTCT CGGCCATCGT CCACCTGCCC ATCCGCGAGA CCCGCAGCCC CGCGTTGCAG CCGGCCTGA
|
Protein sequence | MTKSPIFTPV LISGCIVLML GFAIRASFGV FQIPIAEEFD WPRSDFSMAI AIQNLAWGIG QPIFGMLAEK FGDRRAIVAG ALTYAAGLVL SSFAVTPLQH QFLEVLVGFG IAGTGFGVIL AVVGRATAPE HRSLALGIAT AAGSAGQVFG APAAEILLGF YSWQTVFVIF AGVILAALFA LPFMRAPVTA TKAELEESLG TVLRRAFRDP SYTLIFVGFF SCGYQLAFIT AHFPAFVTEM CGAIDPRGPL AALGITTTSA LGALAISLIG LANIAGTITA GWLGKRYSKK YLLAAIYTGR TLAAALFILV PMTPTTVLLF SLSMGALWLA TVPLTSGLVA HLYGLRYMGT LYGFVFLSHQ LGSFMGVWLG GRMYDMTGDY TMVWWIGVGV GAFSAIVHLP IRETRSPALQ PA
|
| |