Gene Rxyl_2686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_2686 
Symbol 
ID4115128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp2701301 
End bp2702773 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content67% 
IMG OID638037461 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_645415 
Protein GI108805478 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCCGGA CGCTGAGCAG AAAGGAGTTC CTGCAGGTCG CCGGGGCCGG GGTTGCCGGG 
GCGGCCCTTC TGGGGGCCGG CGCCGGCTGT GGCGGCAGCA GCTATCTTCC GAGCGGGGGT
TCCAGGATGA ACGTTGTGGT GGTGATACTG GACAGCCTGC GGCGGGACCA CGTGGGCGCC
TACGGCAACG ACTGGATCCA GACCCCCACC CTCGACGCCC TCGCCAAGGA CAGCCTCCGC
TTCACCCGCC CCTACCCGGA GTCCATCCCC ACCATCCCGG CCCGCCGGGC CGTCTACACC
GGCAAGAGGA CCTGGCCGTT CCGCAACTGG GTGCCCCAGA AGGGGGAGAC CTTCTTCCCG
GCCGGCTGGC AGCGCATCCC CGAGGACCAG TGGAGCGTCG CCGAGATCCT CCTGGACAAC
GAGTACGACA CCGTGCTCAT CACCGACACC CAGCACCAGT TCAAGCCCTC CATGAACTTC
CACCGGGGCT TCAACGTCTT CGACTTCATC CGGGGGCAGG AGAGGGACCG CTACCGCCCC
AAGCAGACCG CCCCGGAGGA GCTGGTCCAG AAGAACGTGG TCCCCGGCAA CGACCGCAGC
ATGGTGGAGA AGGTGCGCCA GTACGTGGCC AACACCCACT ACTACCGCAA CCGCGAGGAG
GACTGGTTCG CCCCGCAGGT GTTCCTGCGG GCCATAAAGT ACCTGGAGGA CGCGGCCAGG
GCGGGCCAGC CGTTCTTTCT CGTTGTCGAC TCCTTCGACC CGCACGAGCC GTGGGACCCG
CCGGAGAAGT ACGTCGGCCT CTACGGGGAG GAGGGCTATT CGGGGCCCGA GCCCATCGTG
CCGAACTACA GCAAGTCCGA CTACCTGGAG GAGGACGAGC TGCGCCGGAT GCGCGCGCTG
TATGCCGCCG AGGTCACCAT GGCCGACCGG TGGCTGGGCA ACTTCCTGGA CAGGATGGAC
GCTCTGGGCT TTCTCGAGAA CACGCTGCTT TTCGTCCTCT CGGACCACGG GGTCTCCCTG
GGCGAGCACG GCTACACCGG CAAGGTGGAC GAGGCGCTCT GGCCCGAGCT GACCGACATA
ATCTTCTACG TGCGCCACCC GGAGGGCAAG GGGTCCGGCA ACACCAGCGA CTTCTACGCC
TCCATCCACG ACATAGCCCC GACCATCCTC TCGCAGATGG GCATAGAGCC CTGGCAGCCG
ATGGACGGGC AGGATCTGAC GCCCATCCTG GAGGGCAAGG GGCCGCAGCG GGAGCGGGAG
CACTTCACGC TGGGCTACGA CGACTACGCG TGGGCGCGCG ACGAGCGCTA CGCGCTCGTG
TGCCGCAACG ACGGCAGCGA GGCGCGCCTC TACGACCTGC AGAGCGATCC GGGGATGGAC
CGGGACATCT CAGGGAGCCA CCCGGAGGTG GTGAGGAGGA TGTGGGAGGG CTACATCCTC
AAGGACGCCG GCGGTCCCCT GCCCAAGTAC TGA
 
Protein sequence
MSRTLSRKEF LQVAGAGVAG AALLGAGAGC GGSSYLPSGG SRMNVVVVIL DSLRRDHVGA 
YGNDWIQTPT LDALAKDSLR FTRPYPESIP TIPARRAVYT GKRTWPFRNW VPQKGETFFP
AGWQRIPEDQ WSVAEILLDN EYDTVLITDT QHQFKPSMNF HRGFNVFDFI RGQERDRYRP
KQTAPEELVQ KNVVPGNDRS MVEKVRQYVA NTHYYRNREE DWFAPQVFLR AIKYLEDAAR
AGQPFFLVVD SFDPHEPWDP PEKYVGLYGE EGYSGPEPIV PNYSKSDYLE EDELRRMRAL
YAAEVTMADR WLGNFLDRMD ALGFLENTLL FVLSDHGVSL GEHGYTGKVD EALWPELTDI
IFYVRHPEGK GSGNTSDFYA SIHDIAPTIL SQMGIEPWQP MDGQDLTPIL EGKGPQRERE
HFTLGYDDYA WARDERYALV CRNDGSEARL YDLQSDPGMD RDISGSHPEV VRRMWEGYIL
KDAGGPLPKY