Gene Rcas_4358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4358 
Symbol 
ID5541871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5612649 
End bp5613929 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content63% 
IMG OID640896464 
Productmajor facilitator transporter 
Protein accessionYP_001434400 
Protein GI156744271 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.569485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGTCCC CATATGCCAT TCCGCCATCG CGTCGCCGTC TTCGTATGCC GCATGCGCTG 
CGCGCCTTGC GCCATCGCAA CTACCGGCTA TTGTTCTTCG GGCAACTCAT CGCGCATATC
GGGTTTTGGA TGCAGGCGAC CGCGCAAGGC TGGCTGGTAT TGCGCCTGAC GGACGCGCCA
TTCTGGCTTG GCGCCACCGC CGCTGCGCAG TCGTTGCCGG TACTGATCCT GTCTGCGCCT
GCCGGCGCTC TGGCGGATCG CATCCCAAAA CGCACCCTGC TGTTGATGAC CCAGGGAACG
GCAATGGCGA TGGCGCTGTT GCTGGCGTTG CTGATCTTCA GTGATGTCGT GCAGGTCTGG
CATGTCCTCA TCGCTGCCCT GATGGTCGGC ATTGCGTCTG CGTTCGAGAA CCCGGCGCGC
CAGGCGTTTA CCATCGAACT CGTTGGGCGA GAGGACCTGA TGAATGCCAT TGCGCTCGAC
TCGACCATCA TGAACGGCGC GCGCATTATT GGACCGGCGG CTGCCGGAGC GCTGGTCGCC
GTCACCGGCG AAGGTCCGGC GTTTCTGTTC AATGGGCTGA GCGTTCTGGC AGTCATCGGC
GGGCTGCTCA TGATGCGACT GCGCCCCTTC GTCGCGCCGC TGCGCCAGAG CCACTGGCAG
CAAATGCGCG AAGGGTTTGC CTATATACGC CGCGATGCGC GGGTGCGACT GCTGCTGCTT
CAGATCGCCG CCCACTGCGT CTTTGGACTG GCATACTTCC CACTCATGCC CTATTTTGCG
CGCAATGTGC TCGGCGCCGA TGCGCAGGGC TTCGGCGTGC TGGCGGCAAC CAACGCCGCC
GGGGCGCTGG CTGCCGCACT GATGATCACC CTCGTCGGCG ATCGTCTGCC GCGGGTTGGC
GTGCGTTCAG TTGCCTTGCT CAGTTATATG CTGTTGCTTG GTGCGTTTAC CCTGACGCGA
TCGTTTGTAC TGGCGATGGC GCTGCTGGCG GCGATTGGAT GGACGGGGAT TATGGTGCTG
ACGTTGACGA ATACCCTGCT GCAAATGGCC GTGCCGGACG ACATGCGTGG ACGGGTGATG
GGTGTCTATA TGCTCGTCGT GATGGGGGTC AGCCAGGTGA GCGGGCTGTT CCTCAGCAGC
GTCGCCGATG TTCTCGGCGA TGTGCCATTG GTCGTCGGAT GTTGGGCGCT GGTCGGCTGG
TGCATTCAGG TCTATCTCTT CACGCTGTGG CGACGCGCGC CGGATAATGC TGCGCAGGTC
GCGTCGTTGC CGCGCGTGTG A
 
Protein sequence
MSSPYAIPPS RRRLRMPHAL RALRHRNYRL LFFGQLIAHI GFWMQATAQG WLVLRLTDAP 
FWLGATAAAQ SLPVLILSAP AGALADRIPK RTLLLMTQGT AMAMALLLAL LIFSDVVQVW
HVLIAALMVG IASAFENPAR QAFTIELVGR EDLMNAIALD STIMNGARII GPAAAGALVA
VTGEGPAFLF NGLSVLAVIG GLLMMRLRPF VAPLRQSHWQ QMREGFAYIR RDARVRLLLL
QIAAHCVFGL AYFPLMPYFA RNVLGADAQG FGVLAATNAA GALAAALMIT LVGDRLPRVG
VRSVALLSYM LLLGAFTLTR SFVLAMALLA AIGWTGIMVL TLTNTLLQMA VPDDMRGRVM
GVYMLVVMGV SQVSGLFLSS VADVLGDVPL VVGCWALVGW CIQVYLFTLW RRAPDNAAQV
ASLPRV