Gene Rcas_3626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3626 
Symbol 
ID5541128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4735169 
End bp4736410 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content63% 
IMG OID640895746 
Productmajor facilitator transporter 
Protein accessionYP_001433693 
Protein GI156743564 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.348332 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATCA CACCGTCGCA GCGTAGCGCG TTTGCCGCAA GGACATTCGC AGCGTTACGG 
CATCGTAATT ATCGCCTCTG GTTCTTCGGG CAGATGGTAT CGCTCTTCGG CTCCTGGATG
CAAACCACTG CTCAGGGATT TCTGGTCTTT CAACTGACCG GTTCGCCCGC GTACCTGGGG
TATGTCGGGT TCGCCGCCGG CATTCCTGCG TGGGCGCTGA CTCTCTATGG CGGCGTGGTC
GCCGATCGTA TGCCGCGCCG AACTCTGCTG ATCATCACGC AGACGGCGCA GATGGCGCTG
GCATTCGCGC TGGCGGCGCT CGTGTTCAGC GGTATCGTCC AACCCTGGCA TATTGTCGCA
CTCTCGTTCC TGCTAGGGAT CGCCAATGCG TTCGACGCTC CGGCGCGCCT GGCATTCGTG
CGTGAACTGG TGGACAAGGA AGACCTGACG AATGGCATTG CGCTCAACGC GACGATGTTC
AACCTGGCGA CGACGACCGG ACCGGCGATG GCGGGGGTGA CCTACACTCT GGTGGGACCG
GCGTGGTGTT TCATGCTGAA CGGCATCTCG TTCCTGGCGG TCATTGGCGC GCTCTGGCGC
ATGCGGATGG CGCCGCAGCC GGTTGCGCCA CGCAGCGCCT CGGCGTGGCG CGACCTGCGC
GAGGGGTTGA GTTACATCCT GCACGAACCG GTGGTGCGCA CGCTGATTGC GCTGGTGGGG
GCGACGAGTT GTTTTGGCAT CTCGTTTGCG ACCCTCTTCC CGGCATGGGC GGTGCGCATT
CTGGGGGGCG ACGCCGCCAC AACCGGTCTC TTGCAATCGG CGCGCGGTCT GGGAGCGCTG
CTGGGAGCGT TGCTGATTGC GTCACTGGGG CGCTTTCAGT TCAAAGGGCG TCTATTGACA
GTCGGCACAT TTGCGTTCCC AACACTGCTC ATTGTGCTGA CCTTCACGAC CTGGCTGCCG
CTGACCCTGG TGCTCCTGAC GGCTTCGGGG CTGGCGGTGA TCCTGATCAT GAACCTGGCA
AATGCGCTGG TGCAGACGCT GACACCCGAT GCGCTGCGGG GTCGGGTGAT GGCGGTCTAC
AGCATGGTCT TTTTCGGAAT GATGCCAATC GGTGCGCTCT GGATCGGGGT GATCGCCGAG
CGAGCCGGTG AACCGACGGC AGTGATCAGC GGGGCGCTGG TGGTCCTGGG AGTCGCAGCG
CTCATACGTT TGGCTGTGCC GCAGATACGG AAGTTGACGT GA
 
Protein sequence
MTITPSQRSA FAARTFAALR HRNYRLWFFG QMVSLFGSWM QTTAQGFLVF QLTGSPAYLG 
YVGFAAGIPA WALTLYGGVV ADRMPRRTLL IITQTAQMAL AFALAALVFS GIVQPWHIVA
LSFLLGIANA FDAPARLAFV RELVDKEDLT NGIALNATMF NLATTTGPAM AGVTYTLVGP
AWCFMLNGIS FLAVIGALWR MRMAPQPVAP RSASAWRDLR EGLSYILHEP VVRTLIALVG
ATSCFGISFA TLFPAWAVRI LGGDAATTGL LQSARGLGAL LGALLIASLG RFQFKGRLLT
VGTFAFPTLL IVLTFTTWLP LTLVLLTASG LAVILIMNLA NALVQTLTPD ALRGRVMAVY
SMVFFGMMPI GALWIGVIAE RAGEPTAVIS GALVVLGVAA LIRLAVPQIR KLT