Gene RoseRS_4401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4401 
Symbol 
ID5211386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5518426 
End bp5519670 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content63% 
IMG OID640597981 
Productmajor facilitator transporter 
Protein accessionYP_001278684 
Protein GI148658479 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.868968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACG CACGGAACCC GGTGCTCCTT CCACCTGCAC AGCGACGACG CGGCGCAGTG 
GTGATGCTGG TGATTACTTT CCTCATGTGG GGCGGATTCT TCATGGTTAT CCCCCTTATA
TCGATACGCT ACGTTGATGA CCTGGGATGG TCGGCGGGAG CGATAGGACT GGTGCTGGCG
ATCCGTCAGT TAACGCAGCA GGGATTGACC GTTTTCGGCG GCGCGCTGGC AGACCGGTTC
GGCGCGAAGG GATTGATCGT CGTCGGCATG TTCATTCGTG CGGTCAGTTT CAGTGCGCTG
GCGCTTGCGT CAACCTATCC GCTGCTGATG ATCAGCGCGC TCATGGCGGC GATTGGCGGC
GCATTGTTCG ATTCGCCATC GTCGGCGGCG ATGGTTGCGC TTACGCGACC GGAAGAGCGC
AACCGGTACT TTGCAGTGCT GGGAGTCGTG CGCAACCTGG GGATGTCCCT CGGTCCGCTG
GCGGGCGCAG TGTTGCTACG GATCGATTTT GCCTTTGTTG CGCTTGCGGC GGCTGGCTGT
TTCTTCATCG CTGCGGCTGT GACGTTGCTG CTCTTGCCGC CGGTGCAGGT TGCAACCGAA
CGCGGTGAAC TGCTGGCGGG CATTCTGCTT GCGCTGCGCG ACCGGCGCTT CATGGCGTTC
AATGTGCTGC TGATGGGATA CTGGTTCATG TGGGTGCAGA TGACCATCTC GCTTCCCCTG
GCGGCGCGCA CGCTTGCCGG GACTGCTGAT GCCGTGAGCT GGCTCTACGC CCTGAATGCA
GGAATGGGCA TCGTGTTGCA GTATCCGGTG GTGCGCATCG CCGAACGCTG GTTGCGCCCG
CTCCCGGTGT TGCTGATCGG CATTGCGCTG ATGGCATTGG GGTTGGGCAG CGTGGCGCTT
GCCAGTACGA CCGGGCTGCT GCTGGCGAGT GTGGCGATCT TTTCATTTGG CGCCTTGCTG
GCTGCGCCGG GACAACAGAC GGTCGCTGCC GAACTGGCGA ATCCGACGGC GCTTGGCTCG
TACTTCGGCG TCAGCGCACT GGCGCTGGCG CTGGGCGGCG GGATCGGGAA TTATGCCGGG
GGGGCGTTGT ACAGCCTGGG ATACCACATT GGCGCACCAG CACTCCCCTG GCTGGTCTGT
CTGGTGGTCG GCATCGGCTC GGCAATCGGT CTGGCGCTGC TCGATCGCCA CCTTACCCGT
CATCCGGCGA ATGTTGCCGA TGCAGCCGTG TCCTATCGCG ACTGA
 
Protein sequence
MTDARNPVLL PPAQRRRGAV VMLVITFLMW GGFFMVIPLI SIRYVDDLGW SAGAIGLVLA 
IRQLTQQGLT VFGGALADRF GAKGLIVVGM FIRAVSFSAL ALASTYPLLM ISALMAAIGG
ALFDSPSSAA MVALTRPEER NRYFAVLGVV RNLGMSLGPL AGAVLLRIDF AFVALAAAGC
FFIAAAVTLL LLPPVQVATE RGELLAGILL ALRDRRFMAF NVLLMGYWFM WVQMTISLPL
AARTLAGTAD AVSWLYALNA GMGIVLQYPV VRIAERWLRP LPVLLIGIAL MALGLGSVAL
ASTTGLLLAS VAIFSFGALL AAPGQQTVAA ELANPTALGS YFGVSALALA LGGGIGNYAG
GALYSLGYHI GAPALPWLVC LVVGIGSAIG LALLDRHLTR HPANVADAAV SYRD