Gene Rxyl_2603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_2603 
Symbol 
ID4114684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp2628556 
End bp2629572 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content69% 
IMG OID638037376 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_645332 
Protein GI108805395 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.912871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAAG GCCTCTTCTG GCGGGGACGC AGGATCTCGC GCCGCGAGTT TCTCGGCGTC 
GCGGCCGCGG GCACCGGGGC GGTGTTCCTC GGGGGCCTCA CCACCTCCTG CGGCGGGGGC
GGAGGAGAAG GCGAGGGCTA CAGGCTCGCT CTCATAGTCG GGGTCACCGG CGACGAGTTC
TACACCACCA TGGAGTGCGG GGCGCGGGCG GCGGCCCGAA AGCTCGGCGC CAGGCTCAAC
GTGCAGGGGC CCGAGGAGTT CTCGCCCGCG GCGCAGACCC CCATCCTGAA CGCCGTGGTG
CAGTCCAACC CCGACGCCAT CCTCATCGCC CCCACAGACC GGACCGCGAT GGTGGGTCCC
ATCCAGAGCG CCGTCAACCA GGACATCCCC GTGGTGCTGG TGGACACCAC CATCGAGAAG
GAGGAGATAG CGCTGGCCCG CATCTCCTCG GACAACGTCG AGGGGGGCAG GATGGCCGGG
GAGGCCCTGG CCGAGCAGAT AGGGGGCAAG GGCAAGGTGC TCCTCATCAG CGTCAAGCCG
GGCATCTCCA CCACCGACCA GCGCAAGCAG GGCTTCGAGG AGGCGATAAA GCAGTACCCG
GACATCGAGT ACCTGGGGAC CGAGTACTGC AACGACGACC CCACCCAGGC GGCCTCCATC
ACCACCTCCA CCCTGCAGGC CCACCCGGAT TTGGCCGGCA TCTTCGGCGC CAACGTCTTC
TCCGGACAGG GAGCCGGGAC CGGGGTCCGG CAGGCGGGCA AGCGGGACCA GGTGAGCGTG
GTGGCCTTCG ACGCCTCCCC CACCCAGGTG GAGGATCTGC GCCGGGGCAA CCTGGACGTG
CTCATCGCCC AGCACCCCAA CGACATCGGG AGAAGGGGCG TCCAGATCGC CGTGAGGTAC
CTGGAGAGCG GCGAGGAGCC GGAGAACAAG CAGATCACCA CCGGCTTCAC CACCGTCACC
CGCGACAACC TGGACGCCCC CGAGGTCGAG CGTTACCTCT ACCGGGCCCA GTGCTAG
 
Protein sequence
MNEGLFWRGR RISRREFLGV AAAGTGAVFL GGLTTSCGGG GGEGEGYRLA LIVGVTGDEF 
YTTMECGARA AARKLGARLN VQGPEEFSPA AQTPILNAVV QSNPDAILIA PTDRTAMVGP
IQSAVNQDIP VVLVDTTIEK EEIALARISS DNVEGGRMAG EALAEQIGGK GKVLLISVKP
GISTTDQRKQ GFEEAIKQYP DIEYLGTEYC NDDPTQAASI TTSTLQAHPD LAGIFGANVF
SGQGAGTGVR QAGKRDQVSV VAFDASPTQV EDLRRGNLDV LIAQHPNDIG RRGVQIAVRY
LESGEEPENK QITTGFTTVT RDNLDAPEVE RYLYRAQC