Gene RoseRS_3336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3336 
Symbol 
ID5210313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4184642 
End bp4186378 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content63% 
IMG OID640596934 
Productbifunctional sulfate adenylyltransferase subunit 1/adenylylsulfate kinase protein 
Protein accessionYP_001277647 
Protein GI148657442 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0529] Adenylylsulfate kinase and related kinases
[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase
[TIGR00455] adenylylsulfate kinase (apsK) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.442858 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACTGA TTCCGCCATA TGGCGGTCGA TTGATCAATC TGCTGGTTGC CAGCGAGGAA 
CGACGCGCGC TGCTCGAAGA GGCGGCGCGA CTCCCCTCGA TCCAGATCTC ACCACGGTCT
TTGTGCGATC TTGAACTGCT GGCAACAGGC GGTTTTTCGC CGCTCGACCG TTTTATGGGG
CGTGCTGATT ATGAGCGCGT CCTGCACGAC ATGCGACTGG CGGGAGGGAC GCTCTTCCCA
CTGCCGATCA CGCTGCCGGT CTCTGGCAAG ACGCTGGCGC GTTCTGGCGA TCGTGTTGCG
CTGCGCGATG CGCGAAACGA ACTGATCGCT GTGATGGATG TCGAGGAAGC CTTCACCTGG
AATGCCGAAG AGGAAGCGCG ATTGACGCTC GGCACAACCG ATCCGCGCCA TCCGCTGGTT
TCGGAAATGA GCACGTGGGG CGACACGTAC ATCTCAGGCG CGCTGCGCGT TGTCCGCCTG
CCCCGCTACT ACGATTTCGT CGAACTGCGG CGCACCCCTG CCGAGGTTCG TTCCATTCTC
CACGAGATGG GCGCGGAGCG GGTGGTCGCT TTCCAGACGC GCAACCCGCT CCACCGTGTT
CATGAGGAAT TGACGAAGCG TGCCGCAGCA GAAGTTGGCG GTGCGTTGCT CATCCATCCG
GTTGTGGGAT TGACCCGTCC CGGTGACATC GACCATTACA GCCGGGTGCG CATCTATCGG
GCGCTCGTGG AGCGGTACTA CGATCCGCGC CGCACGCTGC TGAGCCTCCT GCCGCTGGCG
ATGCGCATGG CAGGTCCGCG TGAGGCGCTC TGGCACGCAA TCATTCGGCG CAACTTCGGC
GCAACCCATT TCATCGTCGG GCGCGACCAT GCCGGTCCCG GTCTCGACAG CCGCGGCAAG
CCGTTCTACG GACCATACGA TGCCCAGGAA CTGGTGGCGC GCTACGCAAA TGAGATCGGC
GTGACGATGG TTCCGTTCCG CGAGTATGTC TACCTCCCGG ACACAGACCA GTACGTTGAA
GAGACCGCCG TGCCGCAGGG AGCGCGCGTC TGGACGATTT CGGGGACGCA GGTGCGTGAA
GAATATCTGG CGCGCGGCAA ACGGCTGCCG GAATGGTTTA CCCGCCCGGA AACGGCGGCA
ATCCTGGCGC AGAGTTACCC GCCGCGGCAC CGCCAGGGGT TCTGTGTCTG GTTCACCGGT
CTCAGCGGCG CCGGCAAATC GACAATCGCC GAGGCGCTGG TGGCGATGTT GCTGGAACGT
GGACGCCAGA GCACACTGCT CGATGGTGAT GTGGTGCGCA CACATCTGTC GAAGGGGCTT
GGCTTCAGCC GGGAGGATCG TGATACGAAC ATTTTGCGGA TCGGATTCGT TGCCGGTGAA
ATCGCGCGAC ACGGCGGCGT CGCGATCTGC GCTGCAATCA GTCCCTACCG TGCTGCGCGG
AACGAGTGCC GCAAAATGGT CGGTGAAGAC CGCTTCTTCG AGGTGTTCGT CGATACGCCG
ATTGAAGAGT GCGAGCGACG TGACACCAAA GGCATGTACG CCCGCGCCCG TCGTGGCGAA
ATCACCGGGT TCACCGGCAT CGACGATCCT TACGAGCCGC CGGTGGCGCC GGAAGTGCAC
CTGACGACCG TCGATACCAC GCCCGAAGAA TGCGCGCGGC GGATTATCGC CCTGCTGGAG
GAACGCGGCT TTCTGACCCG ACCGGATCAG GATGGCGTTT CGGGTGCAAC CGGGTAA
 
Protein sequence
MPLIPPYGGR LINLLVASEE RRALLEEAAR LPSIQISPRS LCDLELLATG GFSPLDRFMG 
RADYERVLHD MRLAGGTLFP LPITLPVSGK TLARSGDRVA LRDARNELIA VMDVEEAFTW
NAEEEARLTL GTTDPRHPLV SEMSTWGDTY ISGALRVVRL PRYYDFVELR RTPAEVRSIL
HEMGAERVVA FQTRNPLHRV HEELTKRAAA EVGGALLIHP VVGLTRPGDI DHYSRVRIYR
ALVERYYDPR RTLLSLLPLA MRMAGPREAL WHAIIRRNFG ATHFIVGRDH AGPGLDSRGK
PFYGPYDAQE LVARYANEIG VTMVPFREYV YLPDTDQYVE ETAVPQGARV WTISGTQVRE
EYLARGKRLP EWFTRPETAA ILAQSYPPRH RQGFCVWFTG LSGAGKSTIA EALVAMLLER
GRQSTLLDGD VVRTHLSKGL GFSREDRDTN ILRIGFVAGE IARHGGVAIC AAISPYRAAR
NECRKMVGED RFFEVFVDTP IEECERRDTK GMYARARRGE ITGFTGIDDP YEPPVAPEVH
LTTVDTTPEE CARRIIALLE ERGFLTRPDQ DGVSGATG