Gene RoseRS_3952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3952 
Symbol 
ID5210936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4948381 
End bp4949622 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content62% 
IMG OID640597546 
Productmajor facilitator transporter 
Protein accessionYP_001278252 
Protein GI148658047 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTC CATCGTTGCA GCGCAGCACA TTTGCCGCAA GGACGTTCGC GGCGTTACGC 
TACCGTAACT ATCGTCTCTG GTTCTTTGGT CAGATGGTGT CGCTCTTCGG CTCCTGGATG
CAGACCACTG CCCAGGGATT CCTGGTGTTC CAGTTGACCG GTTCGCCGGC GTACCTGGGA
TACGTCGGCT TTGCTGCTGG CGTGCCCGCC TGGGCGTTGA CCCTGTATGG CGGCGTGGTT
GCCGACCGTG TGCCGCGGCG GACGCTCCTG ATTGTGACGC AGATAGCGCA GATGGCGCTG
GCGTTCATTC TGGCAGGGCT GGTCTTCAGC GGCATTGTTC AACCCTGGCA CATTATCGTT
CTGTCGCTGT TGCTGGGGGT CGCCAACGCC TTCGACGCCC CGGCGCGCCT GGCGTTCGTG
CGCGAACTGG TCGATAAGGA GGATCTGACG AACGGTATTG CACTCAATGC GACAATGTTC
AACCTGGCGA CCACGACCGG ACCGGCAATG GCGGGCGTCA TGTACAACCT GGTCGGACCG
GCGTGGTGTT TCATGCTCAA CGGTATCTCG TTTCTGGCGG TCATCGGCGC ACTCTGGCGG
ATGCGAATCG CGTCTGGTCC GGTTGCGCCG CGCAATGCCT CCGCGTGGCG CGATCTCCGG
GAGGGTCTGA GTTACATCCT GCACGAGCCG GTGGTGCGGA CGCTGATCGT GCTGGTTGGG
GCGACGAGTT GCTTCGGTAT TTCGTTTGCG ACCCTCTTCC CGGCGTGGGC GGTGCGCATT
TTAGGCGGCG ACGCAACAAC GACCGGGCTG TTGCAATCGG CGCGCGGTCT GGGCGCGCTG
ATGGGCGCGT TGCTGATTGC ATCGCTTGGT CGCTTTCAGT TCAAAGGGCG CCTGCTCACG
GTCGGCACGT TCGCCTTTCC ACTCCTGCTG ATCCTGATGA CCTTCACCAA CCGGTTGTGG
CTGACTCTGG CGATCCTGGT CGCTTCCGGT CTGGCGGTCA TCCTGATCAT GAATCTGGCG
AATGCGCTGG TGCAGACGTT GACCCCTGAT GCGTTGCGCG GTCGGGTGAT GGCGGTGTAC
AGCATGGTTT TCTTCGGCAT GATGCCGGTA GGGGCGTTGT GGGTCGGGGT GGTTGCCGAG
CGGGTGGGAG AACCGGCGGC AGTGATCAGT GGGGCGCTGG CGGTGCTGGG TGTCGCAGGA
GCGATTTTTG TCGCCGTGCC GCAAATTCGG GCGTTGCGGT GA
 
Protein sequence
MTIPSLQRST FAARTFAALR YRNYRLWFFG QMVSLFGSWM QTTAQGFLVF QLTGSPAYLG 
YVGFAAGVPA WALTLYGGVV ADRVPRRTLL IVTQIAQMAL AFILAGLVFS GIVQPWHIIV
LSLLLGVANA FDAPARLAFV RELVDKEDLT NGIALNATMF NLATTTGPAM AGVMYNLVGP
AWCFMLNGIS FLAVIGALWR MRIASGPVAP RNASAWRDLR EGLSYILHEP VVRTLIVLVG
ATSCFGISFA TLFPAWAVRI LGGDATTTGL LQSARGLGAL MGALLIASLG RFQFKGRLLT
VGTFAFPLLL ILMTFTNRLW LTLAILVASG LAVILIMNLA NALVQTLTPD ALRGRVMAVY
SMVFFGMMPV GALWVGVVAE RVGEPAAVIS GALAVLGVAG AIFVAVPQIR ALR