Gene Rxyl_0966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0966 
Symbolsat 
ID4115929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp1003848 
End bp1005029 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content69% 
IMG OID638035751 
Productsulfate adenylyltransferase 
Protein accessionYP_643745 
Protein GI108803808 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.399094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCGCA CGGAGTACAC CACCATAACC CCGCACGGCG GCACCCTCGT GGACCGGCGG 
GTGCCGGTGG GCGAGCGCGA GGAGCGCAGG CAGCGGGCGG CGGAGCTGCC GCGGATAGTC
CTCGGGCCGC GCAACCTCTC GGACCTGGAG ATGATCGGGA CCGGCGTCTT CTCCCCCCTC
ACCGGCTTTA TGGGGCGGGA GGACTACGAG AGCGTCGTGG AGGAGATGCG GCTCGCCGAC
GGGCTGCCGT GGAGCATCCC GATCACGCTC TCCGTCTCCG AGGAGGAGGC CCGCTCCTTC
GAGGAGGGGG ACGAGGTGGC GCTCGCCAAC GGCGAGGGCG AGATCGTGGC CACCATGGTG
GTGGAGGACC GCTACACCTA CGACCGGGCC CACGAGGCCA AGCTCGTCTA CAGGACCACC
GACACCGACC ACCCGGGGGT GGCCGCCCTG TTCAGGCAGG GGGACGTGCT GGTGGGCGGC
GAAGTCTCGC TGCTCGACGA CGGGACCACC ACCCGGCCCT TCCCCCGCTA CTACTACGAG
CCGCGGGAGC TGCGGGCCAT CTTCCGCCAG AAGGGCTGGC GGCGGGTGGT GGGCTTCCAG
ACCCGCAACC CCGTCCACCG CGCCCACGAG TACATCCAGA AGAGCGCGCT GGAGACCGTG
GACGGCCTGC TTTTGAACCC GCTCGTCGGC GAGACCAAGT CCGACGACAT CCCGGCCCAT
GTCCGGATGC GCTCCTACGA GGTGCTGCTG GAGCGCTACT ACCCGCGGGA CCGGACCGTG
CTCGCCGTCT TCCCGGCGGC CATGCGCTAC GCCGGGCCGC GGGAGGCCGT CTTCCACGCC
ATCTGCCGCA AGAACTACGG CTGCACCCAC TTTATCGTGG GGCGGGACCA CGCCGGGGTG
GGCAACTACT ACGGCACCTA CGACGCCCAC CGCATCTTCG ACGAGTTCGA GCCCGGCGAG
CTCGGCATAA CCCCGCTGTT CTTCGAGCAC GCCTTCTTCT GCCTCAACTG CGGCGGGATG
GCGACGACCA AGACCTGCCC GCACGACAAG GACTCCCACG TCTTCTTCTC GGGCACCCGG
GTGCGGGAGA TGCTGCGCAA CGGCGAGTAC CCGCCGCCGG AGTTCTCCCG GCCCGAGGTT
ATAGAGGTGC TGATCTCGGG GCTCAGGCAA CAGGAGGGAT GA
 
Protein sequence
MMRTEYTTIT PHGGTLVDRR VPVGEREERR QRAAELPRIV LGPRNLSDLE MIGTGVFSPL 
TGFMGREDYE SVVEEMRLAD GLPWSIPITL SVSEEEARSF EEGDEVALAN GEGEIVATMV
VEDRYTYDRA HEAKLVYRTT DTDHPGVAAL FRQGDVLVGG EVSLLDDGTT TRPFPRYYYE
PRELRAIFRQ KGWRRVVGFQ TRNPVHRAHE YIQKSALETV DGLLLNPLVG ETKSDDIPAH
VRMRSYEVLL ERYYPRDRTV LAVFPAAMRY AGPREAVFHA ICRKNYGCTH FIVGRDHAGV
GNYYGTYDAH RIFDEFEPGE LGITPLFFEH AFFCLNCGGM ATTKTCPHDK DSHVFFSGTR
VREMLRNGEY PPPEFSRPEV IEVLISGLRQ QEG