Gene RPC_0064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0064 
Symbol 
ID3971397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp72627 
End bp74480 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content70% 
IMG OID637923180 
ProductSulfate adenylyltransferase, large subunit 
Protein accessionYP_529962 
Protein GI90421592 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0529] Adenylylsulfate kinase and related kinases
[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTGCC TCGACCACGC CCACGACACC GAGCTCGCCG CGCCGCTAAA ACATCCCGTC 
GATCGGCCGA CGCTGCGCTT CATCACCTGC GGCAGCGTCG ACGACGGCAA GAGCACGCTG
GTCGGCCGGC TGCTGTATGA TTCCAAGCTG CTGCTCGACG ACCAGCTCAG CGCGCTGCAC
GCCGAGAGCA AGAGCGCCGG CACCGCCGGC CAGGATCTCG ATTTCGCGCT GCTGGTAGAC
GGCCTGCAGG CCGAGCGCGA GCAAGGCATC ACCATCGACG TCGCTTATCG CTTCTTCGCC
ACGCCGCAGC GCCGCTTCGT GGTCGCCGAC ACCCCGGGGC ACGTGCAATA CACCCGCAAT
ATGGCGACCG GCGCCTCCAC CGCCGACCTC GCCGTGGTGC TGATCGACGC CCGCAAGGGC
GTGATCGATC AGACCCGCCG GCACAGCCAC ATCGTCGGAC TGTTCGGCAT CCGCCACGTC
GTGCTGGCGA TCAACAAGAT GGACCTGGTC GGCTTCGACG CCGCGCGCTT CATCGCCATC
ACCCACGCCT ATCGCGCGCT CGCCGCCGAA CTCGGCATCA CCAACGTCTG CTACGTGCCG
GTGGTGGCGC CGGACGGCGA CAACATCTTC ACGCCGAGCG CGCGGATGCC CTGGTACAGC
GGCCCGACCA TCATGGAGCA TCTCGAAGCG GTGGAGGTCG GCGGCGAGAT CCGCGAACGT
CCGTTCCGCA TGCCGGTGCA ATGGGTCAAC CGGCCGAACG CCGAGTTCCG CGGCTTCAGC
GGCCGCATCG CCTCGGGCCG CGTTGCGCGC GGCGACGCGA TCATGGTGCA GCCCTCGGAC
CGCAGCAGCC ATATCGCGCG GATTGTGACG CCGGCCGGCG AGCGCGACGT CGCGGTGGCC
GGGCAATCGG TGACGCTGCT GCTCGCCGAC GAGATCGACA TCAGCCGTGG CGACGTGGTG
AGTTCGGGCG CTGCGGCGAT GGTCTGCGAC CAGCTCGCCG CGCGCCTGGT GTGGTTCGAC
GACGCCGCGC TGGTGCCCGG CCGCCGCTAT CTGTTGAAGA GCGCGAGGTC CTGCGTCGGC
GCGGTGATCT CCGCGCTGAA GCATCGCGTC GCGATCGACA GCATGGCGCA GCAGGCCGCG
ACCACGCTGA ACGCCAACGA GATCGGCGCG GTCGAGCTCT GCCTGGAGCG GCCGCTGGTC
TGCGAGACCT ATCGCGACGA CCGCGAACTC GGCAGCTTCA TCCTGATCGA TCCGCTCAGT
CACAAGACCG CGGCGGCCGG CATCATCGAC GGCGCGTCGC GCCGTGCCGC CAACAACAAC
TGGCCGGCGC AAGATGCCGG CACCGCCGTG CGGGCGCAAA GCAAGCCGCG GCCCTGCGTG
CTGTGGCTGA CCGGGCGCAG CGGCGGCACC ACATCCATCC TTGCTCATCT GCTTGACAAG
CGACTGAACG AATTCGGTCG GCACTGCATC CTGCTCGACG GCGACGCGCT GCGCCACGGC
CTCAACCGCG ATCTCGGCGT CGGCGACGCG GCGCGGATCG AGGGCAGCCG CCGCCTCGCC
GAGATCGCCA AACTGTTCGT CGATGCCGGA CTGATCGCGC TGGTCAAGCC AATCGCGCCG
CCGCCATCCG GAACGGCGCT GGCGCGGGCG CTGTTCTCGG CCGGCGAATT CATCGAGATC
GACGTCGCCG CGCCGGTGCA GGAAACCGCG CGGCGCGATC CGAAGCCGCT GGCGCGCCGG
ACCCGGGCGG CCGAGCTGCC GAAGCGCGCC GACCTCGTGA TCGACGCCGC CACCCTCGCC
GAAGCCGGCT GCGACCGCAT CATCGCCTCT CTGCGCGAGC GCGGCTGCGT TTAG
 
Protein sequence
MLCLDHAHDT ELAAPLKHPV DRPTLRFITC GSVDDGKSTL VGRLLYDSKL LLDDQLSALH 
AESKSAGTAG QDLDFALLVD GLQAEREQGI TIDVAYRFFA TPQRRFVVAD TPGHVQYTRN
MATGASTADL AVVLIDARKG VIDQTRRHSH IVGLFGIRHV VLAINKMDLV GFDAARFIAI
THAYRALAAE LGITNVCYVP VVAPDGDNIF TPSARMPWYS GPTIMEHLEA VEVGGEIRER
PFRMPVQWVN RPNAEFRGFS GRIASGRVAR GDAIMVQPSD RSSHIARIVT PAGERDVAVA
GQSVTLLLAD EIDISRGDVV SSGAAAMVCD QLAARLVWFD DAALVPGRRY LLKSARSCVG
AVISALKHRV AIDSMAQQAA TTLNANEIGA VELCLERPLV CETYRDDREL GSFILIDPLS
HKTAAAGIID GASRRAANNN WPAQDAGTAV RAQSKPRPCV LWLTGRSGGT TSILAHLLDK
RLNEFGRHCI LLDGDALRHG LNRDLGVGDA ARIEGSRRLA EIAKLFVDAG LIALVKPIAP
PPSGTALARA LFSAGEFIEI DVAAPVQETA RRDPKPLARR TRAAELPKRA DLVIDAATLA
EAGCDRIIAS LRERGCV