Gene Rcas_4308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4308 
Symbol 
ID5541819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5558266 
End bp5559975 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content63% 
IMG OID640896414 
Productbifunctional sulfate adenylyltransferase subunit 1/adenylylsulfate kinase protein 
Protein accessionYP_001434352 
Protein GI156744223 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0529] Adenylylsulfate kinase and related kinases
[COG2046] ATP sulfurylase (sulfate adenylyltransferase) 
TIGRFAM ID[TIGR00339] ATP sulphurylase
[TIGR00455] adenylylsulfate kinase (apsK) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTCA TTCCGCCATA CGGTGGTCGT CTGATCAATC TGCTGGTCTC TGGCGAGGAA 
CGTCGTACCC TGATTGAAGA AGCGGCGCGA CTCCCCTCGA TCCAGATTTC AGCGCGTGCG
CTCTGCGACC TCGAAGTGCT GGCGACCGGC GGTTTTTCGC CGCTCGACCG GTTTATGGGG
CGCGCCGACT ACGAATGTGT GCTGCACGAA ATGCGACTGG CAGACGGCAC GCTCTTCCCG
TTGCCGATCA CACTACCGGT CGATGGAAAG ACGCTGGCGC GCCTCGGTGA TCGAATTGCA
CTGCGCGACG CGCGCAATGA ACTGATCGCT GTGATGAATA TCGAAGAGGC ATTTGCATGG
GACGCCGGTC AGGAAGCGCG CCTGACACTC GGCACGACCG ATCCGCGCCA TCCGCTTGTG
TCGGAAATGA GCATGTGGGG CGATACGTAC ATTTCCGGCG CGTTGCAGGT TGTGCGCCTG
CCGCGTTACT ACGATTTCGT GGAACTGCGG CGCACTCCGG CTGAGGTGCG TTCGATCCTG
CACGAAATGG GGGCGGAACG GGTCATCGCT TTTCAGACGC GTAATCCGCT GCACCGCGTT
CACGAAGAAC TGACAAAGCG CGCGGCTGCC GAAGTTGACG GCGCGTTGCT CATCCATCCG
GTCGTCGGGC TGACTCGTCC CGGCGACATC GACCATTACA GTCGGGTGCG AATTTACCGC
GCGCTGGTCG AGCGGTACTA TGATCCACAA CGCACACTGT TGAGCCTCCT ACCGCTGGCG
ATGCGTATGG CCGGACCACG CGAGGCGCTC TGGCACGCAA TCATCCGGCG TAACTTCGGC
GCGACCCACT TCATCGTCGG GCGCGACCAT GCCGGTCCGG GTCTCGATAG CCGTGGCAAG
CCGTTCTATG GTCCCTACGA TGCCCAGGAA CTGGTGGCGC GTCACACCGA TGAAATCGGC
GTTGCGATGG TGCCGTTCCG CGAGTACGTC TACCTGCCCG ACGCCAATGA GTATGTTGAA
GAAACCGCCG TTCCGCCGGG TGCGCGCGTC TGGACAATTT CGGGCACGCA GGTACGAGAT
GAGTATCTGG CGAAAGGCAA ACTGCTGCCG GAATGGTTCA CCCGCCCGGA AACGGCGGCG
ATCCTGGCGC AGAGTTACCC GCCGCGTCAT CGTCAGGGGT TTTGCATCTG GTTCACCGGT
CTCAGCGGCG CGGGCAAATC GACGATTGCC GAGGCGCTGG TGGCGATGCT GTTGGAGCGG
GGACGCCAGA GTACGCTGCT CGATGGCGAT GTGGTGCGCA CGCACCTGTC GAAGGGGCTT
GGGTTCAGCC GCGAAGATCG TGACACGAAC ATTCTGCGGA TCGGATTCGT CGCCGGTGAA
ATCGTGCGAC ACGGCGGCGT AGCGATCTGC GCTGCGATCA GCCCGTACCG CGCAGCGCGT
AACGAGTGCC GCAAGATGGT CGGAGATGAT CGCTTCTTCG AGGTGTTTGT CGATACGCCG
ATCGAAATCT GCGAACGGCG CGACACGAAA GGCATGTACG CCCGCGCCCG CCGTGGCGAG
ATCACCGGCT TCACCGGCAT TGACGACCCC TACGAGCCTC CGGCGGCGCC GGAAGTGCAC
CTCACGACAG TCGATACCAC GCCCGACGAG TGCGCGCGGC GCATCGTGGC CCTGCTGGAG
GAGCGCGGCT TTCTGACGCG ATCGGGCTAA
 
Protein sequence
MSLIPPYGGR LINLLVSGEE RRTLIEEAAR LPSIQISARA LCDLEVLATG GFSPLDRFMG 
RADYECVLHE MRLADGTLFP LPITLPVDGK TLARLGDRIA LRDARNELIA VMNIEEAFAW
DAGQEARLTL GTTDPRHPLV SEMSMWGDTY ISGALQVVRL PRYYDFVELR RTPAEVRSIL
HEMGAERVIA FQTRNPLHRV HEELTKRAAA EVDGALLIHP VVGLTRPGDI DHYSRVRIYR
ALVERYYDPQ RTLLSLLPLA MRMAGPREAL WHAIIRRNFG ATHFIVGRDH AGPGLDSRGK
PFYGPYDAQE LVARHTDEIG VAMVPFREYV YLPDANEYVE ETAVPPGARV WTISGTQVRD
EYLAKGKLLP EWFTRPETAA ILAQSYPPRH RQGFCIWFTG LSGAGKSTIA EALVAMLLER
GRQSTLLDGD VVRTHLSKGL GFSREDRDTN ILRIGFVAGE IVRHGGVAIC AAISPYRAAR
NECRKMVGDD RFFEVFVDTP IEICERRDTK GMYARARRGE ITGFTGIDDP YEPPAAPEVH
LTTVDTTPDE CARRIVALLE ERGFLTRSG