Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2647 |
Symbol | |
ID | 5540129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3412374 |
End bp | 3413948 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640894770 |
Product | major facilitator transporter |
Protein accession | YP_001432737 |
Protein GI | 156742608 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCTGC ATCCGGCGTC ATTGCGAACG ATAGTGACGC AATCTCTCCT CGCTCAAGAA ATCGTCTCGC TGCGGATGCG CCTCGCAATG ACAATATGGC AAAGACGCGC CTTCTTTCAT CCAAACTCAT ATGAGCGAGG CTTCGGATTG GTGCGGGCGC GACGCCCACG CTCCCAGTGC TCCCAGGACC GGTGCGGGCG CAACGCCTGC GTGGGTGCGC CGTGCCGCCT GCCACACCAG CGCCGCGTTA TGGATGCATC TTCCGCCTGC AAGATGTGGT ATGATAACCG TTCCTGCACC TTCTCTGAAC CGACACAGAA ACCTATGCAC ACATCAACGC CTTCGCCTCG TTTTGCTGCA CTGACCGCGC TGCGGTACCG CGACTTCCGG CTGCTCTGGG CTGGTCAGTT TGTGTCGATC ACCGGCACGC AGATGCGCAA TGTCGCCATT GCATGGCAGG TGTACCGGCT GGCACAGGCG GACAGCAGCA TTCGGGTTGA AATCGCGCTT GGACTGATCG GTCTGGCGCG CGTTATTCCG CTGATCCTGA CGGCGATGTT CAGCGGTATG ATCGCCGACC GCGTCGAGCG GCGCAAAATT CTGATTCTGA CCTCGCTCGT GGCGTTTGTG TGTTCAATAG TGCTGGCGCT TACCGGCGAG ATGGAGCGCC CGCCGTTGCT GCTCATCTAC ACGATGGTGG CGCTGGCATC GGTCGCCGGC GCCTTCGAAC TGCCGGCGCG TCAGGCAATC ATTCCCAACC TCGTTGCGCC ACAGCACTTG CCAAATGCGT TGAGTCTGAA CATCGTCGCC TGGCAACTTG CGACGGTTAT TGGTCCGGCG TTGTCAGGCG TCTTGATTGC TGCGGTCGGT GTTGCGCCAG TGTACTGGAT CGATGCCGCG ACCTTTCTGG CAGTCGTTGC GGCAGCGTTG CTCATGCGCA CGCGCACCAT TCCGGCGCGC ATCGAACCGG TGTCGCTCCG GGCGGCGCTG GCGGGGCTGC GTTTCGTCTT TTCACATCGC CTGATTGCTG CAACCATGCT GCTTGATTTC TTCGCTACAT TTTTTGGCGC TACTGGAGTG CTGCTACCGA TCTTTGCCGA TCAGGTCTTG CGGGTCGGAC CGACCGAACT GGGCTGGATG TACGCCGCGC CATCGGTGGG AGCAGTGGTC GCTGCAACCC TGCTCAGCGG TGTGCGCATT CCACGACAGG GGACGACGCT GCTCGCGGCT GTGCTGGCGT TTGGCGCATG CGTCGCAGTG ATCGGGATGT CGCGTTGGCT CCCGCTAACG CTGGCAGCGC TGGCAGGCAT GGGTGCAGCG GATACCGTCA GTATGGTCAT TCGCGGCGCG ATCCGTCAAT TGCTCACTCC CGATGAATTA CGCGGGCGCA TGGTGGCGGT CAATATGGTC TTCTTTGCCG GCGGCCCGCA ACTTGGAGAA ACCAGCGCCG GTTTTATCGC CAGCCTGATC GGTGCGTCTG CGGCGGTAAC CCTCGGCGGC GTGGCGTGTA TCCTCCTGGT TGTTGGAACA GCGCTCAGCG TGCGTGAACT GCGCGAGTAC CAGGGACCGG GCTGA
|
Protein sequence | MTLHPASLRT IVTQSLLAQE IVSLRMRLAM TIWQRRAFFH PNSYERGFGL VRARRPRSQC SQDRCGRNAC VGAPCRLPHQ RRVMDASSAC KMWYDNRSCT FSEPTQKPMH TSTPSPRFAA LTALRYRDFR LLWAGQFVSI TGTQMRNVAI AWQVYRLAQA DSSIRVEIAL GLIGLARVIP LILTAMFSGM IADRVERRKI LILTSLVAFV CSIVLALTGE MERPPLLLIY TMVALASVAG AFELPARQAI IPNLVAPQHL PNALSLNIVA WQLATVIGPA LSGVLIAAVG VAPVYWIDAA TFLAVVAAAL LMRTRTIPAR IEPVSLRAAL AGLRFVFSHR LIAATMLLDF FATFFGATGV LLPIFADQVL RVGPTELGWM YAAPSVGAVV AATLLSGVRI PRQGTTLLAA VLAFGACVAV IGMSRWLPLT LAALAGMGAA DTVSMVIRGA IRQLLTPDEL RGRMVAVNMV FFAGGPQLGE TSAGFIASLI GASAAVTLGG VACILLVVGT ALSVRELREY QGPG
|
| |