Gene RPB_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1043 
Symbol 
ID3908895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1199864 
End bp1200739 
Gene Length876 bp 
Protein Length291 aa 
Translation table11 
GC content66% 
IMG OID637882936 
Productsulfate adenylyltransferase subunit 2 
Protein accessionYP_484664 
Protein GI86748168 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0175] 3'-phosphoadenosine 5'-phosphosulfate sulfotransferase (PAPS reductase)/FAD synthetase and related enzymes 
TIGRFAM ID[TIGR02039] sulfate adenylyltransferase, small subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.582671 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTCG CCGCCCGCGA CCCCCTCGCC GCCAAGCCGG TGTCGTTCGC CGATCTCGAC 
CCGCGCGAAC AGCTCCGCCC GATGGATCAT CTCGACGCGC TGGAAGCGCA GAGCATCTAC
ATATTCCGTG AAGGTTTTGC GCGGCTGAAG AAGCTGGCAC TGCTGTGGTC GCTCGGCAAA
GATTCCAACG TGATGATCTG GCTGGCGCGC AAGGCGTTCT TCGGCAAGGT GCCGTTCCCG
GCGCTGCACG TCGACACCGG CAAGAAGTTT CCTGAGATGT ACGCCTTCCG CGAACACTAC
GCGAAGGAGT GGGATCTCGA TTTGCGCGTC GATCCCTGCC CGCCGATCGA CAGCGTCGAT
CCGACCCTGC CGCCGGCGGC GCGCTCGGCG GCGCGCAAGA CCGAAGGCTT GAAGCTGGCG
CTGGCCAAAT ACGGCTTCGA CGGACTGATC GCCGGCATCC GCCGCGACGA GGAGGCGACC
CGCGCCAAAG AACGCGTGTT CTCGCCGCGC GGCACCGAGG GCGGCTGGGA CGTGCGCGAT
CAGCCGCCGG AATTCTGGGA CCAGTTCAAC GCCTCGCCGC CGCCCGGCGC TCACTTGCGT
ATTCACCCCA TCCTGCATTG GACCGAGGCC GACATCTGGG CCTACACCAA GCGCGAGAAC
ATCCCGATCA TCCCGCTGTA TCTGGCCAAG GACGGCAAGC GCTATCGCTC GCTCGGCGAC
CAGGACATCA CCTTCCCGGT GGCGTCGCAC GCCTCGTCGA TCGACGAGAT CCTGCACGAA
TTGCAGACCA CCAAGGTGCC GGAGCGCGCC GGCCGCGCGC TCGACCACGA GACCGAGGAC
GCGTTCGAGC GGCTTCGCGT CGCCGGTTAT CTGTGA
 
Protein sequence
MTLAARDPLA AKPVSFADLD PREQLRPMDH LDALEAQSIY IFREGFARLK KLALLWSLGK 
DSNVMIWLAR KAFFGKVPFP ALHVDTGKKF PEMYAFREHY AKEWDLDLRV DPCPPIDSVD
PTLPPAARSA ARKTEGLKLA LAKYGFDGLI AGIRRDEEAT RAKERVFSPR GTEGGWDVRD
QPPEFWDQFN ASPPPGAHLR IHPILHWTEA DIWAYTKREN IPIIPLYLAK DGKRYRSLGD
QDITFPVASH ASSIDEILHE LQTTKVPERA GRALDHETED AFERLRVAGY L