Gene Rxyl_0550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0550 
Symbol 
ID4116166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp578974 
End bp580068 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content68% 
IMG OID638035338 
Producthypothetical protein 
Protein accessionYP_643336 
Protein GI108803399 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0452037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCCAT CTTCTTCTGG CGGGCTTGGC GGCTACCGGC TGGGGCGGCG GGAGTTCCTC 
GCGGCGGGGG CCCTTGCGGG GGCGGGGCTC CTGCTCGGGG GTTGCAGGAG GGCCGAGGAG
GCGGGCCAGC AGGGCGGGGG CGGGGGCGTG GGGAACTTCC CCGAGACGCC CGAGTACAAC
TTCGTGTTCG TGAACCACGT CACGACGAAC CCCTTCTTCA CGCCCACGCA GTACGGCATC
GAGGACGCCA GCGCGCTGCT GGGGACCCGC TACCAGTGGA CCGGGTCCGA GACCTCGGTG
GTGCGGGAGA TGGTTGACGC GATGAACACG GCCATCTCGG GGGGTGCGGA CGGGATAGCG
GTCTCCATCG TGGACCCGGA CGCGTTCAAC GACCCCATCC GGCGGGCGCT CGACCAGGGG
ATACCCGTGG TGGCGTACAA CGCCAACGGC AAGGGGCCGG GGACCAACCC GGCGCTGGCG
TACATCGGGC AGGACCTCTT TCTCTCCGGG GTCGAGATGG GCAAGCGGAT CGTGGAGCTC
GTCGAGGAGG GTCCGGTGGC GCTGTTCATC GCCACCAAGG GCCAGCTCAA CATCCAGCCG
CGCATCGACG GGGCCATACA GGCCATCGAG GACTCCGGGG CTCCCATAGA GTACGAGGAG
ATAGAGACCG GGGCCGAGCT GCCCGAGGAG CTCAACAGGA TCGACGCGTA CTACCAGGGG
CACAAGGACG TGCGGGGGAT GTTCGCCGTG GACGCGGGCA GCACGCAGGG GGTGGCGCAG
GTCATGAAGA AGTACAACCT GCACGAGCAG GGGGTGAGGG CCGGCGGCTA CGACCTCCTG
CCCAAGACGC TGGAGATCCT GCGGGAGGGG CACATAGACT TCACCATCGA CCAGCAGCCC
TACCTGCAGG GCTTCTACCC CGTGCTGCAG CTGTACCTGT ACAAGATCTC CGGGGGGCTC
ACCGGACCGG CCGAGACCAA CACCGGGCTC AAGTTCGTCA CCCAGGAGGA CGCCGGGCAG
TACCTGGAGA CCGAGTCCCG CTACGAGGGC GACTCCGAGC GGCCGCGGCT GCTCGAGGCG
CCGGCGGCTT CGTAG
 
Protein sequence
MEPSSSGGLG GYRLGRREFL AAGALAGAGL LLGGCRRAEE AGQQGGGGGV GNFPETPEYN 
FVFVNHVTTN PFFTPTQYGI EDASALLGTR YQWTGSETSV VREMVDAMNT AISGGADGIA
VSIVDPDAFN DPIRRALDQG IPVVAYNANG KGPGTNPALA YIGQDLFLSG VEMGKRIVEL
VEEGPVALFI ATKGQLNIQP RIDGAIQAIE DSGAPIEYEE IETGAELPEE LNRIDAYYQG
HKDVRGMFAV DAGSTQGVAQ VMKKYNLHEQ GVRAGGYDLL PKTLEILREG HIDFTIDQQP
YLQGFYPVLQ LYLYKISGGL TGPAETNTGL KFVTQEDAGQ YLETESRYEG DSERPRLLEA
PAAS