Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2058 |
Symbol | |
ID | 5539538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2639603 |
End bp | 2641018 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640894194 |
Product | major facilitator transporter |
Protein accession | YP_001432163 |
Protein GI | 156742034 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.469976 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.844529 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGAAC GCAAACGCAA CCAGATTTTG CTCGTTCTGT TTATCGGCGT CCTGATGGCG GCGCTCGACA TTGCTATCGT CGGGCCGGCG CTTCCGGCGT TGCGCGAGCA CTTTCAGATC GATGCGCGCG CGGCATCGTG GATGTTCGTT ATCTACGTGC TCCTCAATCT TGTCGGAACT CCGTTTATTG CCAAATTATC GGATCGATTC GGTCGGCGTA TCCTGTATAC GGCAAGCATT GCCCTTTTTG GTCTGGGGTC ATTGATTGTC GTCGCGGCGC CGGTGTATGC GGTGGTGCTG GCAGGCCGGG CTATCCAGGG GCTTGGAGCC GGCGGCATCT TCCCGGTCGC CAGCGCGGTG ATCGGCGATA CTTTTCCGCC GGAGAAACGT GGGAGCGCGC TTGGTCTGAT CGGCGCCGTG TTTGGTATTG CGTTCCTGGT GGGACCGATT ATCGGCGGAT TATTGCTGTT GCTCGGATGG CAGTGGCTGT TCCTGATCAA TCTGCCGATT GCAATGGCGC TGATCGGGTT TGGCGTGAAA TTGTTGCCTG CCATACGCAC AGCGACACCG CGCCCCTTCG ATTGGGGTGG GACGGTTGTG TTGGGTGTGC TGCTGGCGGC GCTGGCTGTG GCGCTCAGTG ATCTTGCCTA TCTGCTGGAA GACGCCAGTG TGTCCGGTCT GGTCAATGCG TTGCGCACAT CCACGGTGGC GCCGCTGTTG TTGCTGGCGC TGGCGCTGGC GCCGGTCTTC TGGTGGATCG AGCGCCGCGC TGAAGACCCG GTGTTGGATC TGAACCTGTT CCGCAACCGG CAGATTGCGC TGGCCGGTGC GCTCTCGTTT GGCGCAGGGT TGAGCGAGGC GGTGACGCTG TTCGTGCCAT CGTTGCTGGT GGCGGCGTTT GGCGTCACAC CATCGACGGC GAGTTTTATG CTGATCCCGA TGGTGCTGGC AATGGCGATT GGTTCGCCGT TGTCGGGGCG CGCGCTGGAT CGGGTTGGTT CAAAGATCGT TGTGCTCACC GGCACCGCCT TGATTGCCGC CGGGTTGGTG CTGGAAGGCA CGCTGGCGAA CGTTTTGCCG GCATTTTATG GATTCTCTGC ACTGTTCGGC ATTGGCATCG GTGTCTTGCT CGGCGCGTCG CTGCGGTATA TTCTGTTGAA CGAAGCGCCG GCCGAAGAGC GAGGCGCAAC CCAGGGAGTG CTGACGGTGT TCATCAGTAT CGGGCAGTTG ATTGGCGCCG TTTTGCTGGG CGCCATAGCA GCCGCACGCG GCGGCGATGT CGGTGGCTAT GCTTTTGCCT TCCTCGTGGT CGGCGTCCTG ATGGCAGTGT TGTTCATCGC ATCGTTTGGG TTGAAGAGTC GCGCCCAGGA ATTGGCGACG CAGCAGCGTT TGCAGCGCGG CGCATCGATG GCATGA
|
Protein sequence | MDERKRNQIL LVLFIGVLMA ALDIAIVGPA LPALREHFQI DARAASWMFV IYVLLNLVGT PFIAKLSDRF GRRILYTASI ALFGLGSLIV VAAPVYAVVL AGRAIQGLGA GGIFPVASAV IGDTFPPEKR GSALGLIGAV FGIAFLVGPI IGGLLLLLGW QWLFLINLPI AMALIGFGVK LLPAIRTATP RPFDWGGTVV LGVLLAALAV ALSDLAYLLE DASVSGLVNA LRTSTVAPLL LLALALAPVF WWIERRAEDP VLDLNLFRNR QIALAGALSF GAGLSEAVTL FVPSLLVAAF GVTPSTASFM LIPMVLAMAI GSPLSGRALD RVGSKIVVLT GTALIAAGLV LEGTLANVLP AFYGFSALFG IGIGVLLGAS LRYILLNEAP AEERGATQGV LTVFISIGQL IGAVLLGAIA AARGGDVGGY AFAFLVVGVL MAVLFIASFG LKSRAQELAT QQRLQRGASM A
|
| |