Gene Rcas_4096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4096 
Symbol 
ID5541607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5310218 
End bp5311468 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content61% 
IMG OID640896208 
Productmajor facilitator transporter 
Protein accessionYP_001434146 
Protein GI156744017 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.25734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.218815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCTCT GCCAGGATAC CGACAAACAT CTCATGAAAC GCAACGCGCT TGCCATTCTT 
TTTCTTGCGG TCTTCGTCGA TCTGGTCGGC TACGGCATGA TCGTGCCGCT GTTGCCGTTC
TATGTGCAGC GCGTCGCTCC CGGCGCCACA CTGGTCGGCA TACTGAGCGG GTTCTATGCT
ATGGCGCAGT TTCTCGTCGG TCCAATGCTG GGGAGTCTGT CGGATCGGTT TGGTCGCCGC
CCGGTGCTGA TTGCTTGCCT GAGTGGCACG TCGCTCGCGT ACCTGCTCCT GGCGATTGCC
GACAGTCTGC CGCTGTTGGT ACTGGCGCTG TTCATCGATG GGGTCACCGG CGGCAATCTC
AGCATCGCCC AGGCGTCGAT CGCCGACAGC ACCACGCCGG ATCGCCGTGC ACGCGGTCTG
GGTCTGATCG GAGCGGCATT TGGATTGGGA CTGATGGTTG GTCCTGTGAT AGGCGGGGTG
CTGAGCCTGA CCAACCTGAG CGCCCCGGCG CTGGTCGCTT CGATGCTGGC GTTTGCGAAC
ACCCTGTTTG CGCTTGCCGC GCTCCCCGAA TCGCTACCGC CGGAACGCCG CCGATTAATC
CCTCTCGATA GCGCGAAGCC ATCGCACTGG AGCATGGTGC TGCGCGTTGC AAACCCACTG
GCGAACCTGA TTGTCCTGCT GCGAATTGTG ACGATCCGTC GCGTGTTGAT GGTCGTAGTG
TTGCTGAACC TCGCATTCTC AGGGCTGTAC AGTAACTTCC CGCTCTTTAC CGCCGCGCGC
TTTGGCTGGG GTATGTTCGA GAATGCGCTA TTCTTTGCGT TTGTGGGTAT CTGCGCAGTG
ACCACACAGG GATTGCTGCT CGGTCGCATG CAGCGCTGGC TGGGAGACGC GCGACTGGCG
CGTGTTGGAA TGATCGTGAT GGTATGCGCC CTGCTCGCAA CCGGTCTGGC GTCAGCGGCA
TGGATGCTCT ATCCATCAGT GGGATTGATC GCGTTTGGCA GCGGTCTGGC AATCCCGGCA
CTCACGAGCC TGCTCTCGCT CCAGGTATCG CCCGCCGACC AGGGGCGCCT GATGGGAGGA
ACGGCAGCAC TGCTCAACCT GACGATGATC GCCGGTCCAG TGGTGGCGGG GATCAGTTTT
GATCGGGCGG GAACGGCGGC GCCATATCTC ATCGGAGCGT TGCTGGGAAG TGCGGCGTTG
TTGATATTCG CCTCGCCAAC GATCATTCCT CGTCAGGAGG CAACGTCGTG A
 
Protein sequence
MILCQDTDKH LMKRNALAIL FLAVFVDLVG YGMIVPLLPF YVQRVAPGAT LVGILSGFYA 
MAQFLVGPML GSLSDRFGRR PVLIACLSGT SLAYLLLAIA DSLPLLVLAL FIDGVTGGNL
SIAQASIADS TTPDRRARGL GLIGAAFGLG LMVGPVIGGV LSLTNLSAPA LVASMLAFAN
TLFALAALPE SLPPERRRLI PLDSAKPSHW SMVLRVANPL ANLIVLLRIV TIRRVLMVVV
LLNLAFSGLY SNFPLFTAAR FGWGMFENAL FFAFVGICAV TTQGLLLGRM QRWLGDARLA
RVGMIVMVCA LLATGLASAA WMLYPSVGLI AFGSGLAIPA LTSLLSLQVS PADQGRLMGG
TAALLNLTMI AGPVVAGISF DRAGTAAPYL IGALLGSAAL LIFASPTIIP RQEATS