Gene Rxyl_1166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_1166 
Symbol 
ID4116850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp1191965 
End bp1193683 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content72% 
IMG OID638035956 
Producttype II secretion system protein E 
Protein accessionYP_643944 
Protein GI108804007 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGCAG GATCCGGGAC AACGGGAGGC AGGGAGAGAA ACCGCAGCGT CTGGAGCCTC 
CTGCTCTCGG AGGGGAGCCT CACCGAGGAG CAGCTGCACC GCGCCGTGGA GGCCCAGAAG
CACGACCCCC GGGATCTCGG CCAGATCCTC GTCTCCCTGG GGTACGTCTC GGCCGAGGAG
CTGGCGCGGG CCCGGGCGCG GCGGCTCGGG CTCGGCTACC TCGAGCCCTC CGAGCGGGAC
GTGGACCCGG CCGCGCTCGG CCTCGTCCCG GAGAGGGTGC TGCGCCGCCA CAGGGCGCTC
CCCCTGCGGC TGGAGGAGGG GAGGCTCGTG GCCGCCCTCG CGGACCCCAC CGACCTGCAG
GCGCTCGACG ACCTGCGGAT GCTCTCCGGC TACCCCGTCA CCCCGGTGGT GGCCACCGAG
GAGGCCATCC GCAGGCTGCA GATAAAGCTC TTCGCCGTGG ACGAGCGGGT GACCGGCATC
CTGCGGGAGG CGGAGCTTCG TGAGGCGCGG GAAGAGGACG ACGACCTCGA CCTCGGGGCC
GGGGCAGGGG CCGAGGAGCG GCCCGTCATA CGGCTGGTGA GCTCCATCCT GCAGCAGGCC
ATCTCCGACG GGGCCTCGGA CATCCACCTC GAGCCGCGCC CCGGCAGGCT CGCCGTGAGG
GTCCGGGTGG ACGGACTGCT CCGGGAGGTC ATGTCCATCC CCCACAGGCT GCAGAGCGGG
GTGATCTCCC GGCTCAAGCT CGTCTCCGGG CTGGACATCG CCGAGCGGCG GCTGCCGCAG
GACGGGCGGT TCTCGGTGAG GATCGGGCAG CAGAAGGTGG ACTTCCGGGT GGCCTCGCTG
CCCACCGTGC ACGGGGAGAA GGTCGTGCTG CGGCTGCTGG ACAACTCGCA CGCCGCGGCC
CGGCTGCCGG AGCTCGGGCT CTCCCCGGAG CTCCACCGCC GCTACGAGAG CGTCTTCCGC
AGGCCCTACG GGGCGATCCT CGTCACCGGG CCCACCGGCA GCGGCAAGTC CACCACCCTG
TACGCCACGC TCGCCGAGCT CAACGACCCC CGGAAGAACA TCATCACCGT GGAGGACCCG
GTGGAGTACA GGATCGAGGG GATAAACCAG ATCCAGGTCA ACCCCCGCAT CGGCCTGAGC
TTCGCCTCCG CGCTCAGGAG CATCCTGCGC AGCGACCCGG ACGTCGTGAT GATCGGGGAG
ATAAGAGACC ACGAGACCGC CAAGATCGCG GTGGAGTCCG CCCTCACCGG GCATCTGGTC
CTGGCCACGC TCCACACCAG CGACGCCCCC GGGGCGCTCA CCCGCCTCAC CGACATGGGC
GTGGAGCCCT TCCTGACCGC CTCGGCGGTG GACTGCGTGG TCGCCCAGCG GCTCGCCCGC
CGGCTGTGCG AGCGGTGCCG GAGGCCGGCG GAGGTGGAGA GGGGCCTGCT CGAGGGCATC
GGCTTCCCCT TCCAGCTGAT CTCCGAGGAG GAGGCGAGCT TTCACCGGGC GGTGGGCTGC
GAGTGGTGCG GCGGGACCGG CTACCGGGGC AGGATCGGGG TCTACGAGCT GATGATGGTG
GACGAGGCGG TGGGGGAGCT CGTCCTGCGG CGCGCCTCCA CCGCGGAGAT CGCCCGGGCG
GCCGAGGCGG GCGGGATGGT GCGGCTGCGG GAGGACGCGC TCCTGAAGGC GGCCCGCGGC
ACGACGACGA TCGAGGAAGC GTTGAGGACG GTGGTATGA
 
Protein sequence
MAAGSGTTGG RERNRSVWSL LLSEGSLTEE QLHRAVEAQK HDPRDLGQIL VSLGYVSAEE 
LARARARRLG LGYLEPSERD VDPAALGLVP ERVLRRHRAL PLRLEEGRLV AALADPTDLQ
ALDDLRMLSG YPVTPVVATE EAIRRLQIKL FAVDERVTGI LREAELREAR EEDDDLDLGA
GAGAEERPVI RLVSSILQQA ISDGASDIHL EPRPGRLAVR VRVDGLLREV MSIPHRLQSG
VISRLKLVSG LDIAERRLPQ DGRFSVRIGQ QKVDFRVASL PTVHGEKVVL RLLDNSHAAA
RLPELGLSPE LHRRYESVFR RPYGAILVTG PTGSGKSTTL YATLAELNDP RKNIITVEDP
VEYRIEGINQ IQVNPRIGLS FASALRSILR SDPDVVMIGE IRDHETAKIA VESALTGHLV
LATLHTSDAP GALTRLTDMG VEPFLTASAV DCVVAQRLAR RLCERCRRPA EVERGLLEGI
GFPFQLISEE EASFHRAVGC EWCGGTGYRG RIGVYELMMV DEAVGELVLR RASTAEIARA
AEAGGMVRLR EDALLKAARG TTTIEEALRT VV