Gene Rcas_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1960 
Symbol 
ID5539438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2509708 
End bp2511285 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content64% 
IMG OID640894095 
Productmajor facilitator transporter 
Protein accessionYP_001432066 
Protein GI156741937 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGCTCG AACAGAAACC AGCAACGATC AGCGCATCAC AGCCGGTCCA GGAACGCGGC 
ATCTCCGCCT GGCTTCAGCA GCCACAGGCG GCGCTGGCTC TGGTCTGCGC CGCTGTATTC
GTTGGCGCTG TCGATCTGAC AATTGTGACA GCGGTTCTGC CGAAGATCAT GGTCGATCTG
AGCGTCTCAA TCGAAACCGA ACTGCACCGC GCTTCGTGGA TCATTACCGG GTATCTGCTT
GCCTACACCA TCAGCATGAC CTTTACCGGT CGCCTGTCTG ACCTGTACGG GCGCCGCGTC
GCCTACATGA TCTGCCTGAC CATTTTTACC ATTGGCTCGA TTGTGGTTGC GGTCGCGCCG
GCGCTCGAAG AGGTGGTCCT GGGACGGGTG GTGCAGGCGC TCGGCGCGGG AGCGCTGGTG
CCGATCTCGA TGGCGCTGGT GAGCGATCTC TTCCCGCCGG AGCGGCGTCC GGCGGCACTG
GGGGTGATCG CCGCCGTCGA CACTGCCGGA TGGATGGTGG GACATGTCTA CGGCGGCGCG
CTCATGCGGT TATTCGATGA CTGGCGGCTG CTGTTCTGGC TCAATGTGCC ATTTGGTATC
ATTGCTCTGG CGTTGACCTG GCTGGCACTA CGGCGGCTGG TCATCACCCG CGCAGCAGGC
AGTTTCGATT GGCGCGGTGC GGTGTTGATC TCGCTCAGCC TGACAGCATT CAACATCGGC
ATGGGCGCCG GCGCCGAGTT GGGGCAGACC GATTTTTATG GCGACCGTCC CGGACCACCT
CCATATGCGT TGCCCTTGAC CCTGGCGTCG CTGGTTGTGC TGGCGGCGTT CATCTGGGTC
GAGCGGCGGG CGCGCGACCC GCTGCTCGAT CTGACGCTCT TCCGTCAGCG CGGAACGGTT
GCGGCATCGA TCATGAATGT ATTGATCGGG TTCGTGCTGG CGCTGGCGAT TGCGAATGTG
CCACTGTTCA TCAATACCCG CCTGGGGCTA CTCTACCCGA CCGACCCGGA CATCCTGCGG
CGAGGCGCGT GGGAGACTGG TCTGGTGCTG TCGGCGCTGA CCCTGGCGCT GGCGCTGCTG
GCCTGGCCTG GTGGTCGCCT GGCGGGACGA TTCGGCGAAC GCCTTCCGGC GCTGATCGGG
CTGGCAGTGG CAACCGCAGG ATACCTGGCA ATGAGCCGCT GGCAATCGGA CACTGATTAC
GGGACGATGG TCGGCGGGCT GACACTGGCA GGCTGCGGCA TCGGTATGGC GCTTGCACCC
ACGGCTTCGG CAATTATCAC GGCTGCCGGA CCCAACCGAC GCGGCGCGGC GTCGGCGCTG
GTCATCATCC TGCGACTGAT CGGCATGACC GCTGGCGTCT CAACGCTGAC GTTGTGGGGT
GTGCAGCGCC AGGATGCGCT GCGCCGCACA GCAGACCCGG CGCTGCTGGC AGACTTTGAT
CAGGTACGGA TGTTTTTGAT CGATGTGGCG GCGCAGGTCG TGGGTGAAAC ATTCCTCTTC
GCCGTCGCAG CGTGCGCGCT GGCGCTCATA GCCGCAATCT GGCTGCCGGG GCGCGCCGTC
AGGGAAGCGT CGGGGTAA
 
Protein sequence
MTLEQKPATI SASQPVQERG ISAWLQQPQA ALALVCAAVF VGAVDLTIVT AVLPKIMVDL 
SVSIETELHR ASWIITGYLL AYTISMTFTG RLSDLYGRRV AYMICLTIFT IGSIVVAVAP
ALEEVVLGRV VQALGAGALV PISMALVSDL FPPERRPAAL GVIAAVDTAG WMVGHVYGGA
LMRLFDDWRL LFWLNVPFGI IALALTWLAL RRLVITRAAG SFDWRGAVLI SLSLTAFNIG
MGAGAELGQT DFYGDRPGPP PYALPLTLAS LVVLAAFIWV ERRARDPLLD LTLFRQRGTV
AASIMNVLIG FVLALAIANV PLFINTRLGL LYPTDPDILR RGAWETGLVL SALTLALALL
AWPGGRLAGR FGERLPALIG LAVATAGYLA MSRWQSDTDY GTMVGGLTLA GCGIGMALAP
TASAIITAAG PNRRGAASAL VIILRLIGMT AGVSTLTLWG VQRQDALRRT ADPALLADFD
QVRMFLIDVA AQVVGETFLF AVAACALALI AAIWLPGRAV REASG