Gene Rcas_2748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2748 
Symbol 
ID5540234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3556602 
End bp3557897 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content62% 
IMG OID640894874 
Productmajor facilitator transporter 
Protein accessionYP_001432837 
Protein GI156742708 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC CGACTGCTAC GATGACCGCT CAGGCAGAAG CGGTTGTCCG GCGCCATGCT 
GCGCGCAATT TCTGGCTCAA CGTCCTCGAC GGCGGAACAT TCTACCTGGG CTTGAGCATG
GTATCGCGCT TCACCGTGCT CCCGCTGTTC GTCGAGCGCC TTTCACCGGA TCGCTGGTTG
CAAGGGCTGA TTCCGACCAT CAACTACACC GGATGGTTCT TGCCCGGACT GTTTATTGTG
CCGCTGATCG CCGCGATGCC GCGCCGCAAA CCGATTCTAC TGACGGCAAC ACTCTTCGAG
CGTTTGCCAT TCCTGCTTTT GGGGCTTGCA CTCATTCTCT GGCCCGACCT ACCGGGCGCC
GGTCTGCTGG CGATCTTCTT CTGTTGCTAC GCTATCCATG CCATCGCTGC CGGATTTGCT
TCTATCCCCT GGCAAGATTT CATTGCGCGC GTCATTCCAG GCACACGGTG GGGCATCTTT
TTCGGTTTGC AGAGCGGACT TGGCGGCTTG CTCGGCATCG GCGGCGCGGC AGTTGCTGCC
TGGGTGCTGA CGGCATTTCC CTTTCCGCAG AGCGTCGGCA TTCTATCACT CATCTGCTTT
GGGTCGATGG TCGTGTCATT CATCTTCCTG GCGGCGACGG TCGAGCCGGC GCTGCCACCG
CAGCCAGCGC AACCGCTGAC CGCGTTCCTG CGCGGCGTGC GACCACTGCT GGAGCGCGAC
GCACCGTTTC GCAACTATCT GATCGGGAGG GTCGGCATTG CGCTGGCGCT CATCGGTCAT
AACTTCATCA CAGCCCTTAG TCTCGAACGC TTCAACCTGT CTGGCGCTGA TGTCGGCGGT
TTTACAGCGG CGCTGCTGGC GGCGCAGGCG GTGAGCGACC CGATCCTCGG CGGGATGGCG
GATCGCTGGG GACACAAGCA GGTGCTGGAA CTTTCGACGG CGGTAGGATT GGCGGCGATT
CTGCTGGCGC TGATTGCACC ATCGCCCGCG TGGTTCTTTG TCATTTTTGT GCTGGTCGGC
TTTTCACAGG CGGGGTATAT TCTCTCCGGC TTTACGCTGG TGTTCAGTTT TGCGCCGCCG
GCGCAGCGCC CTGCCTATAT TGGCGTGTCG AACATCGTGA TGGCGCCGAT CGCCGCCGCA
GGACCGCTCC TCTCCGGTTG GCTCGCCGAA CTCGCAAGTT ACGAGGCGCT CTTCGTTGTG
CTGATGGCGG TTGGCATCGT CAGTCTGGGG TGGATGCGCA TGCGTGTGCC GCGCCCGGCA
GTGAAGGCAG CGCTGGCGGA GGAGCGCGTG GGGTAG
 
Protein sequence
MTDPTATMTA QAEAVVRRHA ARNFWLNVLD GGTFYLGLSM VSRFTVLPLF VERLSPDRWL 
QGLIPTINYT GWFLPGLFIV PLIAAMPRRK PILLTATLFE RLPFLLLGLA LILWPDLPGA
GLLAIFFCCY AIHAIAAGFA SIPWQDFIAR VIPGTRWGIF FGLQSGLGGL LGIGGAAVAA
WVLTAFPFPQ SVGILSLICF GSMVVSFIFL AATVEPALPP QPAQPLTAFL RGVRPLLERD
APFRNYLIGR VGIALALIGH NFITALSLER FNLSGADVGG FTAALLAAQA VSDPILGGMA
DRWGHKQVLE LSTAVGLAAI LLALIAPSPA WFFVIFVLVG FSQAGYILSG FTLVFSFAPP
AQRPAYIGVS NIVMAPIAAA GPLLSGWLAE LASYEALFVV LMAVGIVSLG WMRMRVPRPA
VKAALAEERV G