Gene RPB_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1042 
Symbol 
ID3908894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1197904 
End bp1199832 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content71% 
IMG OID637882935 
Productadenylylsulfate kinase 
Protein accessionYP_484663 
Protein GI86748167 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0529] Adenylylsulfate kinase and related kinases
[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00455] adenylylsulfate kinase (apsK)
[TIGR00485] translation elongation factor TU
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACG CCATGACCAT CGCCGTCCCG CCCGCCACGC TCTCCACGCC GAACGGCACC 
ACGCGCCCCT TGGTGCGCAT CGTCATCGTC GGCCATGTCG ACCACGGCAA ATCGACGCTG
GTCGGCCGGC TGCTGCACGA GACCGGCTCG CTGCCGGACG GCAAGCTGGA GATGCTGAAA
GCGGTCAGCG CCCGGCGCGG CATGCCGTTC GAATGGTCGT TCCTGCTGGA TGCGCTGCAG
ACCGAGCGCG ATCAGGGCAT CACCATCGAC ACCACGCAGA TCTCCTTCCG CACCCGCTCG
CGCGACGTCG TGCTGATCGA CGCGCCCGGC CACGCCGAAT TCCTGCGCAA CATGATCACC
GGCGCCTCGC AGGCCGACGG CGCGGTGCTG ATCATCGACG CGCTCGAAGG CGTCCGCGAC
CAGACGCGGC GGCACGGCTA TCTGTTGCAA CTGCTCGGCA TCAAGCAGGT CGCGGTGGTG
GTCAACAAGA TGGACCGCGT CGACTTTTCG GCCGAGCGGT TCGGCGCGAT CAGCGACGAA
ATCAGCGCAC ATCTGATCGG GCTCGGCGTG ACCCCGACGG CGGTGATCCC GATCTCGGCG
CGCGACGGCG ACGGCGTCGC CGAGCGCACC GCCAACATCG CCTGGTATGT CGGCCCGACC
GTGGTCGAGG CGCTGGATGC GCTGACGCCG TCGCAGCCGC TCGATCTGCT CGCGCTGCGG
CTACCGGTGC AGGCGATCTA CAAATTCGAC GACCGCCGCA TCGTCGCCGG CCGCATCGAA
TCCGGCCGGC TGCAGGCGGG CGACGACATC GTGATCATGC CGGCCGGCAA GATCGCGAAG
ATCAAATCGG TCGAGAGCTG GCCGGTGACG CCGGTCGAAG GACCGCAGGG CGCCGGCCGC
TCGGTCGGCA TCACGCTCGA CCGCGAATTG TTCGTCGAAC GCGGCGACGT CATCGGCCAT
GTCGGCGCCG CGCCGCGCGA TACGCGGCGG CTGCGCGCGC GGATCTTCTG GCTGCACGAT
CAGCCGCTCG AAGCCGGCGC CTCGCTGCTG GTGCGGCTCG GCACCCGCGA GACCCGCGCC
ACCGTGGTGG CGATCGAGAA GGCGGTCGAT CCGGGCGAAC TCGCCAGCGA GGCCGCGACC
GCGATCCGGC GCAATCATGT CGGGGAGATC GACCTGTCGC TGGCGCAGCC GGTGGCGGCC
GATCCGGCCG AGGCGTTTCC GCGCAGCGGC CGGCTGGTGA TCGAAATCGG CGGCCGGATC
GCCGGCGGCG GGCTGGTGCT CAGCGTCGAC TCGGGCCAGC GCGCCGCGAG CGCCGACATC
GTGCCGGTGG ACTCCGCGCT GCGGCCGGAC GAACGCGTCG CGCGCTTCCG CCACAATGGC
GCGGTGCTGT GGTTCACCGG CCTGCCGGGC TCCGGCAAAT CGACGCTGGC CAAGGCGATC
GAGCGCCGGC TGTTCGACCG CGGCGGCTCG CCGATCCTAC TCGACGGCGA CACGCTGCGC
GCCGGGCTCA ACAGCGACCT CGGCTTTGCG CCGCACGACC GCGCCGAGAA CATCCGCCGC
CTCGCCGAGA TCGCCGCGCA TCTCGCCACC AACGGCCACA TCGCCATCGT CGCCGCGGTG
TCGCCGGCGC TGGACGATCG CGCCGCCGCC CGCCGCATCG CCGGCGACCG CTTCCGCGAA
ATCCATGTCG CGACGCCGGC CGAAATCTGC GAGCAGCGCG ACCCCAAGGG CCACTACGCG
AAGGCCCGCG CCGGCGCGCT GCAGAACTTC ACCGGGATCG GCAACGACTA TCAGCCGCCG
ACCGCGAGCG AACTGGTGCT CGACACCTCG GCGCAGACCG TGGCCGAAGC CGCCGACACG
ATCGAGGCGA TGCTGGCGCG CAGCGGCGTG CTGGTCGACG AACTGGTCGA TCTCGCCGCG
AACATCTGA
 
Protein sequence
MSDAMTIAVP PATLSTPNGT TRPLVRIVIV GHVDHGKSTL VGRLLHETGS LPDGKLEMLK 
AVSARRGMPF EWSFLLDALQ TERDQGITID TTQISFRTRS RDVVLIDAPG HAEFLRNMIT
GASQADGAVL IIDALEGVRD QTRRHGYLLQ LLGIKQVAVV VNKMDRVDFS AERFGAISDE
ISAHLIGLGV TPTAVIPISA RDGDGVAERT ANIAWYVGPT VVEALDALTP SQPLDLLALR
LPVQAIYKFD DRRIVAGRIE SGRLQAGDDI VIMPAGKIAK IKSVESWPVT PVEGPQGAGR
SVGITLDREL FVERGDVIGH VGAAPRDTRR LRARIFWLHD QPLEAGASLL VRLGTRETRA
TVVAIEKAVD PGELASEAAT AIRRNHVGEI DLSLAQPVAA DPAEAFPRSG RLVIEIGGRI
AGGGLVLSVD SGQRAASADI VPVDSALRPD ERVARFRHNG AVLWFTGLPG SGKSTLAKAI
ERRLFDRGGS PILLDGDTLR AGLNSDLGFA PHDRAENIRR LAEIAAHLAT NGHIAIVAAV
SPALDDRAAA RRIAGDRFRE IHVATPAEIC EQRDPKGHYA KARAGALQNF TGIGNDYQPP
TASELVLDTS AQTVAEAADT IEAMLARSGV LVDELVDLAA NI