Gene Rcas_0274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0274 
Symbol 
ID5537736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp340472 
End bp341791 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content62% 
IMG OID640892438 
Productextracellular solute-binding protein 
Protein accessionYP_001430425 
Protein GI156740296 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.285114 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTGGC CTGTGTTGGT GCGCCGGGGA TGGCTGGTCG TGCTGTCCTC CCTGCTCCTG 
GTGTCGTGCG GGTTGCTCGA ACCGCCGCCG CCGACTCCGA CGCCTGAGCC GCGCGTGTTG
CGCGTCTACA CTGTGCGCGA CCCCGCTATC GAGGGGGTGG TTGAGATTGT CAGCCAGGAG
TTTCGCCGTC GTCATCCCGA CGTGCAGATT GAACTCATCG ACGGCAGCGG CGATTATAGC
GAATTGCGAG GCAGCGCCGC AGCCGGCAAC GTTCCCGATG TCGTCTGGCT GATCGATATG
CTGACCGAGT CGCTGCTCGA GTCTGATCTA ATCATCGATC TGGAAGAATT CGCCAGAGGC
GACGACAGCG TCAACCTGGC AGATGTGCAT CCGATGGCGC TGGAACTCGG ACGATCCCGC
AAGCGCCCTG GTCTGTTCAT GATTCCGGTA TCGCTCGAAA CCATTCAGAT GTATTACAAC
CGTTCGCTCT GGGAACAATC CGGCGCGCCG CTGCCGCGCG ATGACTGGAC ATGGGATGAT
CTGATCGCAG CGTGCAAACG CCTTCAGGGG GCGGCGCCAG GGGTTGATTG CCTGAGTTTC
ACGAATGCCA GCCTAAACGG CTACGCCTGG TGGGTTTACT GGCTGCCGTG GGTGCGCGGC
GCTGGCGGCG ATGCGCTCAG CGCTGACGGA ACGCAATCAA CGTTGAGTTC GCCGCAGTCG
CTCACAGGGT TGCAGGCATA TGTCGATCTC TGGCTCACGC ACAAAATCGC AGCCCCACCC
GCTTCTGGCG GACGCGACTG CTTCGTGGAT CAGACGTGCG CTGCATTCTT TTCGTTTGCC
GGCGTTGCGC AGCGGTACCG CGATCAGATC GGCGACCGCT TCGCCTGGGA TGTGCAATTG
GTTCCGAGCC ATCCGGCAGG ACGCTTCACC GGCATCGGCA CGTATGGCTT CGCCGTAACG
CGCGCCTCGC GCGATCCGCA ACTGGCATGG GATTTTGTGA AAATCTTTAT TGCTCCAGAA
ACGCAGCGTG CGCTGACGGC TGCACATCTG GGCACGCCGG TGCTCCTGTC GTTGAGTAAC
GATCCGACAA TGATGCAACT GCCCGCGCCA CCGGCGAACA TGCGCGCGTT TGTGATCGGG
CGCGAGGCAG GTATTGCACC GCCGCGCTAC CCGACGGCAT GCGGCAGCGT CTACACCGGT
CCGGTGGCGT CGGCTCTCGA TGACGCGCTC AACGCCGCAG TGCGCGGGTT GGCGTCCGTC
GAGGGGGCGT TTGCGGTTGC AGACCGCAAG ATACAGACAT GCCTGGATGC GAATCGGTAG
 
Protein sequence
MQWPVLVRRG WLVVLSSLLL VSCGLLEPPP PTPTPEPRVL RVYTVRDPAI EGVVEIVSQE 
FRRRHPDVQI ELIDGSGDYS ELRGSAAAGN VPDVVWLIDM LTESLLESDL IIDLEEFARG
DDSVNLADVH PMALELGRSR KRPGLFMIPV SLETIQMYYN RSLWEQSGAP LPRDDWTWDD
LIAACKRLQG AAPGVDCLSF TNASLNGYAW WVYWLPWVRG AGGDALSADG TQSTLSSPQS
LTGLQAYVDL WLTHKIAAPP ASGGRDCFVD QTCAAFFSFA GVAQRYRDQI GDRFAWDVQL
VPSHPAGRFT GIGTYGFAVT RASRDPQLAW DFVKIFIAPE TQRALTAAHL GTPVLLSLSN
DPTMMQLPAP PANMRAFVIG REAGIAPPRY PTACGSVYTG PVASALDDAL NAAVRGLASV
EGAFAVADRK IQTCLDANR