Gene Rxyl_0213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0213 
Symbol 
ID4117840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp213621 
End bp214961 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content68% 
IMG OID638035004 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_643003 
Protein GI108803066 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGGCGGA GAGAGGAGGG AGGTTCCCTG GTGCGCGGGA GAGCGCTCTT CAGCAGGCGG 
GAGTTCCTCA GGCTCGGCGG GGCCGGGGTG GCCGGGGCGG CGCTGCTCGG GGTCGCGGGC
TGCGGCGGCG GGGGAGAGCA GGGGGGGCCG GTGCAGCTGG TCTTCTCCCA CGGGCCCGAG
CAGTCCGGCG TCCTCAGAGA GCAGCTGGAC GCCTTCAACC GGCGGCACGA GGGCGAGATC
CGGGTCGAGT GGCGGGAGAT GCCGGCCCAG ACCGAGCAGT ACTTCGACCG CCTCAGAACC
CAGTTCCAGG CCGGCGGGGG GGATATCGAC ACCATAAGCG GCGACGTGAT CTGGCCGGCC
CAGTTCGCGG CGAACGGCTG GATCGTGGAC CTCTCCGACC GCTTCCCCGA GTCCGAGAGA
GAGAAGTTCC TCGACGGCCC CATAAACTCC AACGTCTACG AAGGGGCGAT CTACGGGGTC
CCCTGGTTCA CCGATGCGGG CATGCTCTAC TACCGCAAAG ACCTCCTGCA GAAGAGCGGG
TACTCGGAGC CGCCCAGAAC CTGGGACGAG CTCAAGGAGA TGGCACTGCG CGTCAAGCAG
GACTCCGGGA CCAAGTTCGG TTTCGTCTTC CAGGGGGCGA ACTACGAGGG CGGGGTGGTG
AACGGTCTCG AGTACATCTG GACGCACGGG GGGGATGTGC TGGACCCGGA GGACCCCACG
AAGGTCATCA TAGACAGCCC CGAGTCGGTG GCGGGGCTGA AGACCGAGCG GAGCATGGTG
GAGGAAGGGG TGGCGCCAGA GGCGGTGGTC AACTACGCCG AGATGGAGTC GCACACCGCC
TTTCTGAACG GGGATGCCGT CTTCATGCGC AACTGGCCCT ACGTGTTCGG GCTCTTCGGG
CAGTTCCCGG TGAAGCCGGA GCAGGTGGAC GTGGCCCCGC TGCCGGTGGA CCGGGAGGGG
CGGCAGTCCA CGAGCAGCCT CGGCGGCTGG AACCTGTTCA TCAACGCCGC CTCGGAGGAC
GAGGCGGACG CCGCCTGGAC CCTCATAGAG TACCTCGCCG CCCCCGAGCA GCAGAAGCAG
CGGGCGCTGG AGGGAGGGTA CCTTCCCACG CTGGAGGAGC TCTACGAGGA CCAGGAGATC
CTGGACAAGG TGCCGGTCAT AGCGCTCGGC AAGGAGGCCA TCAGGAACAC CCGCCCGCGC
CCGGTCTCGC CGTACTACTC GGACATGTCG CTCAGGATGG CCGAGCAGTT CAACGCCTCC
CTCAAGGGCG AGGTCTCCCC CGAGGAGGCC GTCGGCACCC TGCGGGAGGA GCTGCAGAAC
ATCGTGGAGC AGGGCAGCTA G
 
Protein sequence
MRRREEGGSL VRGRALFSRR EFLRLGGAGV AGAALLGVAG CGGGGEQGGP VQLVFSHGPE 
QSGVLREQLD AFNRRHEGEI RVEWREMPAQ TEQYFDRLRT QFQAGGGDID TISGDVIWPA
QFAANGWIVD LSDRFPESER EKFLDGPINS NVYEGAIYGV PWFTDAGMLY YRKDLLQKSG
YSEPPRTWDE LKEMALRVKQ DSGTKFGFVF QGANYEGGVV NGLEYIWTHG GDVLDPEDPT
KVIIDSPESV AGLKTERSMV EEGVAPEAVV NYAEMESHTA FLNGDAVFMR NWPYVFGLFG
QFPVKPEQVD VAPLPVDREG RQSTSSLGGW NLFINAASED EADAAWTLIE YLAAPEQQKQ
RALEGGYLPT LEELYEDQEI LDKVPVIALG KEAIRNTRPR PVSPYYSDMS LRMAEQFNAS
LKGEVSPEEA VGTLREELQN IVEQGS