Gene Rxyl_3001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_3001 
Symbol 
ID4115800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp3008682 
End bp3009713 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content70% 
IMG OID638037771 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_645723 
Protein GI108805786 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGCTGGA GAAGGATCCT GGTTTTGCTG GTGGCGGCCG TCGCCGCGCT CGCCCTCGCC 
GCGTGCGCCG AGGTCAGGGA GCAGGGAGGG GGCCAGCAGG GCGGCGGAGA GGGCCGGCAG
GGCCCCATAG AGCTCGCCGT CGTGCCCAAG GCCGTGGGCT TCGACTTCTG GGAGACGGTG
CGTCAGGGGG CGGTGTGCGC CGCCAAGAGG GCCGAGGGCG AGGTCGACGT CCAGTGGGAC
GGGGTCGCCC AGGAGACCGA CGTTACCGGG CAGGTCAACC TGCTGCAGAA CTTCATCACC
CAGGGGGTGG ACGGGCTCGT CTACGCCGCC ACCGACGCCA AGGTGCTCCA CGACGTCACG
CAGCAGGCGC TCGACCAGGG CATAACCGTG GTCAACATAG ACTCCGGCAC CGACCCGCAG
CCCGAGAACG TGCCGGTCTT CGCCACGGAC AACGTGGCGG CCGCCGAGCG GGCGACCGAG
TACCTGGTGG AGCAGCTCGG CGAGGACGGC GGGAAGGTGG CGTTCATCCC CTTCCAGCCC
GGCACGGCGA CGAACGACAC CCGCACGGAG GGCTTCAAGA ACGTCCTCAA GGAGAACCCG
CAGGTAAAGC TCGTCGCCGA GCAGTCCAGC GAGAGCAACT ACAACCGGGC GCTGCAGGTC
ACCGAGGACA TCCTCACCGC CCACCCGGAT CTGGACGCCA TCTACGCGGC CAACGAGCCC
GGCGTGCTGG GCGCCGCCGA GGCGGTGAGG AGCGCCGGGA AGGCCGGGGA GATCATCATC
GTCGGCTGGG ACACCGCCCC CGACGAGCTC AAGGCCGTGC GCGAGGGCGT GGTGAGCGCG
CTCATCGCCC AGAACCCCTT CAGGATGGGC TACGACGGGG TGAACGCGGC GGTGAAGATG
ATCCGTACCG GCGAGCAGGT CGAGGGCGGC GACACGGGGG CGATACTGGT CACCCGGGAG
AACATAGACG ACCCGGAGGT CCAGCGGGTC CTCGACCCGA GCTGCGAGAA CCCGCCCGTC
GAAGGGCAGT AG
 
Protein sequence
MCWRRILVLL VAAVAALALA ACAEVREQGG GQQGGGEGRQ GPIELAVVPK AVGFDFWETV 
RQGAVCAAKR AEGEVDVQWD GVAQETDVTG QVNLLQNFIT QGVDGLVYAA TDAKVLHDVT
QQALDQGITV VNIDSGTDPQ PENVPVFATD NVAAAERATE YLVEQLGEDG GKVAFIPFQP
GTATNDTRTE GFKNVLKENP QVKLVAEQSS ESNYNRALQV TEDILTAHPD LDAIYAANEP
GVLGAAEAVR SAGKAGEIII VGWDTAPDEL KAVREGVVSA LIAQNPFRMG YDGVNAAVKM
IRTGEQVEGG DTGAILVTRE NIDDPEVQRV LDPSCENPPV EGQ