Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A2789 |
Symbol | |
ID | 3836229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 3223449 |
End bp | 3225155 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637826900 |
Product | major facilitator superfamily protein MFS_1 |
Protein accession | YP_427873 |
Protein GI | 83594121 |
COG category | [G] Carbohydrate transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0589] Universal stress protein UspA and related nucleotide-binding proteins [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTCGT CTTCCGCCGG TGCCGCCGCC GGCGCCCCCT CGCTGTTTCG CCAGCCCAAG GCGGTGTGGG CGACCGCCTT CGCCGCGGTG ATCGGTTTCA CCAGCATCGG ACTGGTCGAC CCGATCCTGA CCTCGATCGC CGAGGGGCTG AGCGCCACGC CCAGTCAGGT CTCGCTGTTG TTCACCAGCT ATTTCTTCGT CACCGCCGTG ATGATGCTGG TCACCGGCTT CGTCTCCAGC CGCATCGGCG GGCGCAACAC CTTATTGCTT GGCGCCCTGC TAATCGCCGT CTTCGCCGCC CTGGCCGGCA CCTCCGACTC GGTGGCCGAA CTGGTCGCTT ATCGGGCGGG ATGGGGCTTG GGCAACGCCT TTTTCGTCGT CACCGCGCTG TCGGTGATCG TCGGCGCCGC TTCGGGCGGC ACGGCCGGGG CGATCTTGCT TTATGAGGCC GCCTTGGGCT TGGGCATCTC GGCCGGACCG TTGATCGGCG CGGCGCTCGG CGCCCATTCC TGGCGCTATC CGTTCTTCGG CACGGCGGCG CTGATGACCA TCGGCTTCCT GGCCATCGCC GTGTTCCTTG ATCCCCAGCC CAAGCCGGCG CGCAAGATCG GCTTGTCGGG GCCTGTGAAG GCCTTGCGCC ATCCCGGCTT GCTGACGACC TCGGTCAGCG CCTTTTTCTA TTACTACGCC TTCTTCACCG TGCTGGCCTT CGCGCCCTTC GTGCTGCGAC TGTCGGCCCA TGCCATCGGC CTGATCTTCT TTGGCTGGGG GGTGGCGCTG GCCCTGTTCT CGGTGCTGGT CGCCCCCCGT TTGCAGGCGC GCTTTGGCGC CTTCGCCCTG CTTGCCGTCA GTCTGGTCGG CTTCGCCCTG CTGCTCTGCG TCATGGCCTT CGGCACGATC CCGATGATCG TGACCGCCGT CGTCGCCTCG GGGGCGCTGA TGGGCGTCAA TAACACCGTC TATACGGAAA TGGCGCTGGA GGTTTCCGAG CAGCCGCGCC CGGTGGCTTC GGCGGCCTAT AATTTCCTGC GCTGGTTCGC CGGCGTCATC GCCCCTTATG CCGCCTCGCG CCTGGGCGAG AGCTCCGGTC CGGCCAGCGC CTTCCTGACG GCGGCGGGCG CCGCGCTGAT CGGTGGGGCG ATCCTGGTGG CGCTGCGGCG CAATTTGGGC CGCTACGGCC AGACGCGCCA GGACGCAATC CCGCCGACCG TCCCGTCGAT CGGGCCGATT CTGGTCGGGC TTGATGGTTC GGCCGCCGAT CGGGCGGTGC TGGCGCGGGC GGTTACCCTG GCGCGCCAGG GCGGCGGGGC GGTGTTCGTG CTGCACATTC GCCCGATCGA GGTGTTTGGC GAATTCGCCG CCGCCCTGGA AGACCGCGCC GCCGGCCGGG CGGTTGTCGA GAACGCCGTG GCAAGCCTCG GCGCCCAGGG GATCACCGCC ATGGGCGAGG TGCTCGAGGA ATCGTCCACC CTGGTCCCCC AGCGGGTCAT CGCCCGCGCC CGGGCCCTGG CCGCCCGGGT GATCGTGCTG GGCACCCGCC ATCCCGGCGA TCTTGGCAAT CTGTTGCACG GCTCGGTGGC CGATATCGTC GGCCGCGAGG CCGGACGGCT GGTCGAACTG GTTCCGAGTT CGGCGGGGGA AGGCGAGGCC GGGGAGACTG CCTTAGAACC CCGGGCGCTC GCCGCCACGC CAGGACACCG GGTTTAG
|
Protein sequence | MSSSSAGAAA GAPSLFRQPK AVWATAFAAV IGFTSIGLVD PILTSIAEGL SATPSQVSLL FTSYFFVTAV MMLVTGFVSS RIGGRNTLLL GALLIAVFAA LAGTSDSVAE LVAYRAGWGL GNAFFVVTAL SVIVGAASGG TAGAILLYEA ALGLGISAGP LIGAALGAHS WRYPFFGTAA LMTIGFLAIA VFLDPQPKPA RKIGLSGPVK ALRHPGLLTT SVSAFFYYYA FFTVLAFAPF VLRLSAHAIG LIFFGWGVAL ALFSVLVAPR LQARFGAFAL LAVSLVGFAL LLCVMAFGTI PMIVTAVVAS GALMGVNNTV YTEMALEVSE QPRPVASAAY NFLRWFAGVI APYAASRLGE SSGPASAFLT AAGAALIGGA ILVALRRNLG RYGQTRQDAI PPTVPSIGPI LVGLDGSAAD RAVLARAVTL ARQGGGAVFV LHIRPIEVFG EFAAALEDRA AGRAVVENAV ASLGAQGITA MGEVLEESST LVPQRVIARA RALAARVIVL GTRHPGDLGN LLHGSVADIV GREAGRLVEL VPSSAGEGEA GETALEPRAL AATPGHRV
|
| |