Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4358 |
Symbol | |
ID | 5541871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5612649 |
End bp | 5613929 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640896464 |
Product | major facilitator transporter |
Protein accession | YP_001434400 |
Protein GI | 156744271 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.569485 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGTCCC CATATGCCAT TCCGCCATCG CGTCGCCGTC TTCGTATGCC GCATGCGCTG CGCGCCTTGC GCCATCGCAA CTACCGGCTA TTGTTCTTCG GGCAACTCAT CGCGCATATC GGGTTTTGGA TGCAGGCGAC CGCGCAAGGC TGGCTGGTAT TGCGCCTGAC GGACGCGCCA TTCTGGCTTG GCGCCACCGC CGCTGCGCAG TCGTTGCCGG TACTGATCCT GTCTGCGCCT GCCGGCGCTC TGGCGGATCG CATCCCAAAA CGCACCCTGC TGTTGATGAC CCAGGGAACG GCAATGGCGA TGGCGCTGTT GCTGGCGTTG CTGATCTTCA GTGATGTCGT GCAGGTCTGG CATGTCCTCA TCGCTGCCCT GATGGTCGGC ATTGCGTCTG CGTTCGAGAA CCCGGCGCGC CAGGCGTTTA CCATCGAACT CGTTGGGCGA GAGGACCTGA TGAATGCCAT TGCGCTCGAC TCGACCATCA TGAACGGCGC GCGCATTATT GGACCGGCGG CTGCCGGAGC GCTGGTCGCC GTCACCGGCG AAGGTCCGGC GTTTCTGTTC AATGGGCTGA GCGTTCTGGC AGTCATCGGC GGGCTGCTCA TGATGCGACT GCGCCCCTTC GTCGCGCCGC TGCGCCAGAG CCACTGGCAG CAAATGCGCG AAGGGTTTGC CTATATACGC CGCGATGCGC GGGTGCGACT GCTGCTGCTT CAGATCGCCG CCCACTGCGT CTTTGGACTG GCATACTTCC CACTCATGCC CTATTTTGCG CGCAATGTGC TCGGCGCCGA TGCGCAGGGC TTCGGCGTGC TGGCGGCAAC CAACGCCGCC GGGGCGCTGG CTGCCGCACT GATGATCACC CTCGTCGGCG ATCGTCTGCC GCGGGTTGGC GTGCGTTCAG TTGCCTTGCT CAGTTATATG CTGTTGCTTG GTGCGTTTAC CCTGACGCGA TCGTTTGTAC TGGCGATGGC GCTGCTGGCG GCGATTGGAT GGACGGGGAT TATGGTGCTG ACGTTGACGA ATACCCTGCT GCAAATGGCC GTGCCGGACG ACATGCGTGG ACGGGTGATG GGTGTCTATA TGCTCGTCGT GATGGGGGTC AGCCAGGTGA GCGGGCTGTT CCTCAGCAGC GTCGCCGATG TTCTCGGCGA TGTGCCATTG GTCGTCGGAT GTTGGGCGCT GGTCGGCTGG TGCATTCAGG TCTATCTCTT CACGCTGTGG CGACGCGCGC CGGATAATGC TGCGCAGGTC GCGTCGTTGC CGCGCGTGTG A
|
Protein sequence | MSSPYAIPPS RRRLRMPHAL RALRHRNYRL LFFGQLIAHI GFWMQATAQG WLVLRLTDAP FWLGATAAAQ SLPVLILSAP AGALADRIPK RTLLLMTQGT AMAMALLLAL LIFSDVVQVW HVLIAALMVG IASAFENPAR QAFTIELVGR EDLMNAIALD STIMNGARII GPAAAGALVA VTGEGPAFLF NGLSVLAVIG GLLMMRLRPF VAPLRQSHWQ QMREGFAYIR RDARVRLLLL QIAAHCVFGL AYFPLMPYFA RNVLGADAQG FGVLAATNAA GALAAALMIT LVGDRLPRVG VRSVALLSYM LLLGAFTLTR SFVLAMALLA AIGWTGIMVL TLTNTLLQMA VPDDMRGRVM GVYMLVVMGV SQVSGLFLSS VADVLGDVPL VVGCWALVGW CIQVYLFTLW RRAPDNAAQV ASLPRV
|
| |