Gene Rcas_1026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1026 
Symbol 
ID5538492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1338035 
End bp1339258 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content61% 
IMG OID640893165 
Productmajor facilitator transporter 
Protein accessionYP_001431148 
Protein GI156741019 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGTCGT TTTCTCACCT TTTCTTGCAG CGCCTCGAAC GGTTCGTGAT CGGTTCGTAC 
CCGGAATCCG TTCGCGCCAA TGTTCGCATC GAATTGTTCG CCTCATTCTG GTATGGTCTC
TTTTTTGCCG CCAGCCTGAC GTTTTTTCCG GTCATTCTGC GCCGCTTGGG AGCGACCCCC
TTCGATCTTG CGCTGTATGT CATCTTCAGT TATATCGGTC AAACCCTATC GCCCTTGAGT
CTAGCGCTCC TCCAACGCGC GTCGCCGTTG TGTTTTTCGA TCATCGCCTG GTCGTTTGGG
CGCGGGTTGC TCCTGTGTGG CGCGCTGCTC ACCGGAGCGC CCTGGCTGCT GGCGCTTGCG
GCGTTGTTCT GGATCGCCGA GGCGCTGCCG GCGCCAGCCT ATGCGCGCAT CATGCAACAG
ATTTATCCGC CGCGCTACCG GGGGCGGGCA ATGTCTGCCG TGCGAATCGG GGTCGCCATC
GTGGTGCTGG TTGCTACGCC GGTCGCAGGC TGGGCGCTCG ATCAGGTCGG GCATCAGCCG
CTCTTCGCGC TGGCTGCAAT CTTTGGCGTG GTTTCGAGCC TGATTTTCTC ACGGGTTCGC
CCACTCGATG GCGTTGCGGA ACCGGAGAGT CCGCCGACAC TGCGGGAGTT GCTCCCGATC
GTGCGCCGCG ATCGGCGATT TATGCTGTAT CTGATCGTCC TGGTCGTTTA TGGGTTTGGA
GCAGTGATGG CGCTGCCGCT CTACCCGCTG GTACAGGTAA GCCGGCTGCA ACTCTCGTAC
ACCACGATTG GTTATCTGAA TCTGGTGCAG TCGATCTTCT GGTTGGCGGG ATTCTTTGTG
TGGGGGCGGC TGCTGGACCG TTATGGACCG CTGTGGGTGT TGCGCTTGAG CATGCTCCTG
GCAGCCTTTG TGCCGTTCAC GTATGTGTGG GCTGGCAATG CCTGGATGCT CCTGCCGGCG
TTTATCTGTC AGGGATTGAT GCAGGGCGGT TTTGAATTAG GGATTACCAC CGGAGTCATC
GATCTCGCCG AACGCGGTCG CGTGATGGAA TACACGGCGC TGCAAGCAGC GATTATCGGC
GTGCGCGGTA TGCTGGCGCC GCTGCTCGGG TCGCTGCTCC TGGGCATTGG CGCTTCCGAA
GCACTGGTGC TCGGCGCCGC CACCGGGTTG GTGATCGCGT CCTGCGTGTT GCTGGGCGCC
GTGCAACCGC CACGGACCAG TTGA
 
Protein sequence
MQSFSHLFLQ RLERFVIGSY PESVRANVRI ELFASFWYGL FFAASLTFFP VILRRLGATP 
FDLALYVIFS YIGQTLSPLS LALLQRASPL CFSIIAWSFG RGLLLCGALL TGAPWLLALA
ALFWIAEALP APAYARIMQQ IYPPRYRGRA MSAVRIGVAI VVLVATPVAG WALDQVGHQP
LFALAAIFGV VSSLIFSRVR PLDGVAEPES PPTLRELLPI VRRDRRFMLY LIVLVVYGFG
AVMALPLYPL VQVSRLQLSY TTIGYLNLVQ SIFWLAGFFV WGRLLDRYGP LWVLRLSMLL
AAFVPFTYVW AGNAWMLLPA FICQGLMQGG FELGITTGVI DLAERGRVME YTALQAAIIG
VRGMLAPLLG SLLLGIGASE ALVLGAATGL VIASCVLLGA VQPPRTS