Gene Rxyl_0872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0872 
Symbol 
ID4117202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp912890 
End bp914515 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content68% 
IMG OID638035655 
Productextracellular solute-binding protein 
Protein accessionYP_643651 
Protein GI108803714 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00426332 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCGAC GGAACGCGCG AGCCAGGGTA GGGCTAGCCG GGATCTCCCG CGGGGAGTTC 
CTCAAGCTGA GCGGCGCCGG CCTCGCCGGC GCCGCTCTTC TGGGTGCTGC CGGCTGCGGC
GGCGAGCAGG GCGGCGGCCA GCGGGGCGGC GGTGGCGGCG GGCGGACGGA GCTCATCGTG
GGTCTGGACC AGGAGCCCGC CATCCTGAAC GGGTACATCG TGGGCGGGGA TCTGGTCGCC
ACCTCCAACG TCACCCGGCC CGTCATGGAG AGCGTGCTGC AGATAATGCC GGACCTCTCG
TACGCGCCGA AGCTCGCCGA CGGGGAGCCG CGGGTGGCCA GCGAGGACCC GCTCACCATC
GAGTTCAGGC TGAAGGACGG GATCACCTGG TCCGACGGGG AGCCGCTCAC CGTCGAGGAC
TACGTGTTCA CCTACAACAC CGTCATGAAC GATCGGTGGC AGATCATCAC GCGCGAGATC
TGGGACAGCA TAGACCGGAT CGAGACCCCC GACGAGCTGA CCGCCAGGAT CATCTTCAAG
AGGCCCGACG CCCGCTGGCG GGACATCCTT GCCGCCGACG TGCTGCCCAA GCACGTCCTG
CAGGGGAAGA ACTTCAACAA ATACTTCAAC GACCGGATCG TGGGCAGCGG CCCCTACGTC
TTCGAGGAGT GGCGCAAGGG GCAGAGCCTC ACGGTCGTCG CCAACGAGAA CTACTGGGGC
GACCCCCCGG CGATAAAGAA GATCACCTTC CGCTTCATCC CCGACACCAA CTCGCTGAAG
GCCGCCCTGC GCTCCGGCGA GGTGCAGTTC ATCAACCCGC CGCCGGACAT CGGGCTGATC
GAGGAGCTGC GGGGCTACGA CGGGGTGACC GTCCAGACCA AGTTCGGCAC GGTCTGGGAG
CACCTGGCCT TCAACGTGGA GAAGGTGGAC AACCTCAACA TCCGGCGGGC CATCGCCTAC
GCGGTGAACC GCCGCCAGCT GATCCAGGAG ATCCTGCAGG GCGAGGCCCG GCCGCTGCAG
AGCGTGTTGG TGCCCGAGCA GGAGCCCTTC TACACCCCGG CCTGGGAGCG CTACTCCTTC
GACCCGGACC GGGCGCGCCG GCTCGTGGAG CAGGCGCGGG GGGAGGGGGC CTCCACGGAG
ATAGAGTACT CCACCACCTC CGGGAACGCG CTGCGGGAGA CGGCCCAGCA GGTCATCCAG
CAGCAGATGG AGCAGGTGGG GATAACGCTC CGGATAAACA ACTCCTCCGC CGAGACCTAC
TTCGGGGAGC GCACGCCCGA GGGCGACTTC GAGATGGGCG AGTGGGCCTG GAGCGCGACC
CCCGACCCCT CCATCACCAC GCTCTTCGGG GCGAACCAGG TGCCGCCGAA CGGGCAGAAC
TACTACCGCT ACCGCAACGA GGAGGTCACC CGGCTGCTCG AGCAGGCCGA CATCACCGTG
GACCAGCAGG AGCGGGCCCG GCTCACCCGC AGGGCCCAGG AGCTCATGGC AGAGGACGTG
CCGCTGGTCC CGCTCTACCA GCGGCCCGAG ATCTACGCCT ACGCCGACAA CCTCGAGGGG
CCGAGGGTCA ACCCGACGCT GGCCACCGCC TTCTGGAACG TCGGGGAGTG GCGCTTCACC
GGGTAG
 
Protein sequence
MGRRNARARV GLAGISRGEF LKLSGAGLAG AALLGAAGCG GEQGGGQRGG GGGGRTELIV 
GLDQEPAILN GYIVGGDLVA TSNVTRPVME SVLQIMPDLS YAPKLADGEP RVASEDPLTI
EFRLKDGITW SDGEPLTVED YVFTYNTVMN DRWQIITREI WDSIDRIETP DELTARIIFK
RPDARWRDIL AADVLPKHVL QGKNFNKYFN DRIVGSGPYV FEEWRKGQSL TVVANENYWG
DPPAIKKITF RFIPDTNSLK AALRSGEVQF INPPPDIGLI EELRGYDGVT VQTKFGTVWE
HLAFNVEKVD NLNIRRAIAY AVNRRQLIQE ILQGEARPLQ SVLVPEQEPF YTPAWERYSF
DPDRARRLVE QARGEGASTE IEYSTTSGNA LRETAQQVIQ QQMEQVGITL RINNSSAETY
FGERTPEGDF EMGEWAWSAT PDPSITTLFG ANQVPPNGQN YYRYRNEEVT RLLEQADITV
DQQERARLTR RAQELMAEDV PLVPLYQRPE IYAYADNLEG PRVNPTLATA FWNVGEWRFT
G