Gene RoseRS_0263 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0263 
Symbol 
ID5207198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp331594 
End bp332913 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content62% 
IMG OID640593892 
Productextracellular solute-binding protein 
Protein accessionYP_001274648 
Protein GI148654443 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.962251 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00117926 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACACC GCCTCTCGCT GCGCTCCGCG CTCTGTCTGC TGACGCTGTC TCCATTGCTT 
GCCGCATGTG GCTGGTTTGC GCCGCCAACG CCGACGCCGA CGCCCGAACC GCGGGTGCTG
CGGGTATACA CGGTGCGTGA CCCGACGATC GAAGGGGTCG TTGAGATTAT CAGCCAGGGA
TTCCGTACCC GACATCCCGA TGTAACCATC GAATTCATCT ATGGCGATGG GAGTTATAGC
GAATTGCAGG GCAGCGCAGC CACCGGCAAC ATCCCCGATG TCGTCTGGGC GCCGGACGTG
ATAACGCCAG CGCTCATCGA GGCGGAACTG CTGCTCGACC TGGAAGAGTT CGCCAGGGGC
GACGGAAGCG TCAATCTGGA AGACGTGCAT CCAGTAGCGC TTGAACCTGG ACGGTCGCGC
GTTCGTCCCG GTCTCTACCT GATTCCCGCA TCCCTCGAAA CCATCCAGAT GTACTACAAT
AGTTCCTTGT GGGAACGCTC CGGCGCGCCG TTGCCGCGCG ACGATTGGAC GTGGGACGAT
CTGATTGCGG CATGTAAACG GGTGCAAGAA TCCACGCCGG GCGTCGATTG TCTGAGTTTC
ACGAATAGCG GATTGTTCGA CCACACCGCA TGGGTGTACT GGCTGCCATG GGTGCGCGGC
GCTGGCGGCG ATGCCCTCAG CGCCGACGGT GCGCAATCAA CGCTGAGTGC GCCGCAGTCG
CTCGAAGGGT TGCAGGGGTA CCTCGACCTC TGGATCCGGC ACAAGATCGC GGCACAACCG
GGCGCCAGCC AGGACGATTG TTTTGTTGCG CAAACGTGTG CGGCATTCTT CTCATTTGCC
GGCGCCGCAC GGATCTACCG TGAGCAAATC GGCGACCGCT TCGCGTGGGA CGTACAGATT
GTGCCAGCGC ATCGGGCAGG ACGTTTCACC GGTATCGGCT CATACGGCTT TGCCGTGACC
CGCGCTTCTC GCGAGCCGCA ACTGGCATGG GACTTTGTGA AATATATCAT CACGCCGGAA
GCGCAGCGCG CCATCGCTGC CGCCTATCTG GGCACGCCGG CGCTCCTGTC GCTGAGCAAC
GATCCGGCGG TGGTGCAGTT GCCGCCGCCG CTGGCGAATA TGCGCGCCTT CGTTGTCGGG
CGCGAGGCAG GCATTACGCC GCCGCGCTAC CCGACCGCCT GCGGCAGCGT CTACAACGGT
CCGGTTTCCG CTGCTATCGC CGATGCACTC AATGCTGCAC TACGCGAAAC AGTGTCGGTG
GAGGGCGCAT TTACTATTGC TGATCGCAAG ATACAGACCT GTCTGGACGC GAATCGGTAG
 
Protein sequence
MKHRLSLRSA LCLLTLSPLL AACGWFAPPT PTPTPEPRVL RVYTVRDPTI EGVVEIISQG 
FRTRHPDVTI EFIYGDGSYS ELQGSAATGN IPDVVWAPDV ITPALIEAEL LLDLEEFARG
DGSVNLEDVH PVALEPGRSR VRPGLYLIPA SLETIQMYYN SSLWERSGAP LPRDDWTWDD
LIAACKRVQE STPGVDCLSF TNSGLFDHTA WVYWLPWVRG AGGDALSADG AQSTLSAPQS
LEGLQGYLDL WIRHKIAAQP GASQDDCFVA QTCAAFFSFA GAARIYREQI GDRFAWDVQI
VPAHRAGRFT GIGSYGFAVT RASREPQLAW DFVKYIITPE AQRAIAAAYL GTPALLSLSN
DPAVVQLPPP LANMRAFVVG REAGITPPRY PTACGSVYNG PVSAAIADAL NAALRETVSV
EGAFTIADRK IQTCLDANR