Gene RPB_0032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0032 
Symbol 
ID3909715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp30131 
End bp32929 
Gene Length2799 bp 
Protein Length932 aa 
Translation table11 
GC content67% 
IMG OID637881913 
ProductPII uridylyl-transferase 
Protein accessionYP_483655 
Protein GI86747159 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2844] UTP:GlnB (protein PII) uridylyltransferase 
TIGRFAM ID[TIGR01693] [Protein-PII] uridylyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.968292 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.957944 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAAG TCGTGACACC TCAACGGCCG GCCGCCGACG AGCGGTTCGA TACAACGCGG 
ATCACCGCCG ACATCGCGGC GCTGGCCGGG CAGCATGCCG GCAACGAGCC GACGTTCCGC
ACCGCGCTCG CGCAACTCAT GAAGGCCGAA CTCGCCAAGG CCCGCGCGGC GGCCGAGGCG
CAGTTGCTGC GTGACCGCCA CGGCCGGCGC TGTGCCGAGC ATCTGTGCTT CGTCCAGGAC
GAGATCATCC GGCTGCTGTT CATGGCGGCG ACTCAATATC TTTACAATTC GCCGACGCCG
TCGAGCTCGG AGCGGATGTC GGTGGTCGCC ACCGGCGGCT ACGGCCGCGG CCTGATGGCG
CCGGAGAGCG ACATCGATCT GCTGTTCATC CTGCCCTACA AGCAGACCGC CTGGGGCGAG
CAGGTCGCCG AGGTGATCCT GTATTGCCTG TGGGACATCG GTCTCAAGGT CGGCCACGCC
ACCCGCTCGG TCGACGAGTC GATCCGTCAG GCCCGGGCCG ACATGACGAT CCGCACCGCG
ATCCTGGAAA CCCGCTTCCT GACCGGCGAC AAGGCGCTGT ATCAGGAACT GGTCGACCGC
TTCGACAAGG AAGTGGTCGA GGGCACCGCC GCCGAATTCG TCACCGCCAA GCTCGCCGAG
CGCGAGGAGC GGCATCGCCG CTCCGGCCAG TCGCGCTATC TGGTCGAGCC CAACGTCAAG
GACGGCAAGG GCGGCTTGCG CGACCTGCAC ACGCTGTTCT GGATCGCCAA ATACGTCTAC
CGCGTTCGCG GCACCAGCGA ACTGCTCGAA CACGGCGTGT TCGACCAGAT CGAATTCCGC
ACCTTCCGCC GCTGCGAGGA TTTCCTCTGG TCGGTGCGCT GCAATCTGCA TTTCGTGACG
CGGCGCGCCG AGGAGCGGCT GTCGTTCGAC CTGCAGCGCG AGATCGGCGT GCGGCTCGGC
TACACCTCGC ATCCCGGGAT GCAGGACGTC GAGCGCTTCA TGAAGCACTA CTTCCTGATC
GCCAAGAAGG TCGGCAATCT GACCGCTATC CTGTGCGCCA AGCTCGAGGA CCAGCAGGCC
AAGCCGGCGC CGGCGCTGTC GCGGATGATG GCGCGGCTGC GGCCCGGCGC GACGCGACGC
CGGGTACCCG AGAGCGACGA CTTCGTCATC GACAACAACC GCATCAATTG CGCGATGCCG
GACGTGTTCA AGCACGACCC GGTCAATCTG ATCCGGATCT TCCGGTTGGC GCAGAAGAAC
AACCTCGCCT TCCATCCCGA CGCGATGCGG ACGGTGACGC GCTCGCTGGC ACTGATCAAC
GCCCAGCTCC GCGACGATCC CGAGGCCAAT CGACTGTTCA TCGAGATCCT GACCTCGGAC
AATGCCGAAC AGGTGCTGCG GCGGATGAAC GAGACCGGCG TGCTCGGCCG CTTCATCCGC
GCCTTCGGCC GCATCGTCTC GATGATGCAG TTCAACATGT ATCACAGCTA CACGGTGGAC
GAGCATCTGC TGCGCTGTGT CGGCAACCTC CAGGAGATCG AGCGCGGCGG CAACGACGAG
TTCATCCTGT CGTCCGACCT GATCCGCAAG ATCCGCCCCG ACCACCGTGC GGTGCTGTAT
GTCGCGGTGC TGCTGCACGA CATCGCCAAG GGCCAGCCGG AGGACCACTC CACCGCCGGC
GCCAAGGTGG CGCGGCGGCT GTGCCCGCGG TTCGGCTTCA ACGCCGCCGA CACCGAACTG
ATCGCCTGGC TGATCGACAA GCATCTGGTG ATGTCCACCG TGGCGCAATC ACGCGACCTG
TCGGACCGCA AGACCATCGA GAATTTCGCC GCCGTGGTCG AGTCGGTCGA GCAGATGAAG
CTCTTGACCA TCCTCACCAC CGCCGACATT CGCGGCGTCG GCCCGGGCGT CTGGAACGGC
TGGAAGGCGC AGCTGTTGCG CACGCTGTAT TACGAGACCG AGCCGGTGCT GACCGGCGGC
TTCTCCGAGG TCAACCGCGC CGAGCGCATC CGCGGCGCGC AGGCGGAATT CCGCGCCAAT
TTCACCGAAT GGCCGGCGGC CGAGCTCGAC GCCTACGTGG CGCGGCACTA CCCGGCGTAC
TGGCTCAAGG TCGAGCTGGC GCGCAAGATC CGCCACGCCC GCTTCCTGCG CGCCTCCGAA
CAGGCCGGCC ACAAGCTCGC GGTCAATGTC GGCTTCGACG AGGCGCGCGG CGTCACCGAA
CTGACGATCC TGGCGGTCGA CCATCCCTGG CTGCTGTCGA TCATCGCCGG CGCCTGCGCC
TCGGCCGGCG CCAACATCGT CGACGCCCAG ATCTACACCA CCACCGACGG CCGCGCGCTC
GACACCATCT CGATCAGCCG CGAATACGAT CGCGACGAGG ACGAGGGCCG CCGCGCGACG
CGGATCGGCG AGATGATCGA GGAAGTGCTG GAAGGCAAGC TGCGGCTGCC CGAAGCGGTG
GCGCGGCGCG CCACCAACGG CCGCGCCAAG CTGCGCGCCT TCGTGGTCGA GCCGGAAGTC
TCGATCAATA ACAATTGGTC GGACCGCTAC ACCGTGATCG AGGTCAGTGG CCTCGACCGC
CCCGGCCTGC TGTATCAGCT CACCACCGCG ATCTCGAAGC TGAACCTCAA CATCGCCTCG
GCGCATGTCG CCACCTTCGG CGAACGCGCC CGCGACGTGT TCTACGTCAC CGATCTGCTC
GGCGCCCAGA TCACCGCGCC GACCCGGCAG GCCGCGATCA AGCGCGCGCT GGTTCACCTG
CTCGCCAATG GCGACGCGGA GCAACAGCCG GCGGCGTGA
 
Protein sequence
MDKVVTPQRP AADERFDTTR ITADIAALAG QHAGNEPTFR TALAQLMKAE LAKARAAAEA 
QLLRDRHGRR CAEHLCFVQD EIIRLLFMAA TQYLYNSPTP SSSERMSVVA TGGYGRGLMA
PESDIDLLFI LPYKQTAWGE QVAEVILYCL WDIGLKVGHA TRSVDESIRQ ARADMTIRTA
ILETRFLTGD KALYQELVDR FDKEVVEGTA AEFVTAKLAE REERHRRSGQ SRYLVEPNVK
DGKGGLRDLH TLFWIAKYVY RVRGTSELLE HGVFDQIEFR TFRRCEDFLW SVRCNLHFVT
RRAEERLSFD LQREIGVRLG YTSHPGMQDV ERFMKHYFLI AKKVGNLTAI LCAKLEDQQA
KPAPALSRMM ARLRPGATRR RVPESDDFVI DNNRINCAMP DVFKHDPVNL IRIFRLAQKN
NLAFHPDAMR TVTRSLALIN AQLRDDPEAN RLFIEILTSD NAEQVLRRMN ETGVLGRFIR
AFGRIVSMMQ FNMYHSYTVD EHLLRCVGNL QEIERGGNDE FILSSDLIRK IRPDHRAVLY
VAVLLHDIAK GQPEDHSTAG AKVARRLCPR FGFNAADTEL IAWLIDKHLV MSTVAQSRDL
SDRKTIENFA AVVESVEQMK LLTILTTADI RGVGPGVWNG WKAQLLRTLY YETEPVLTGG
FSEVNRAERI RGAQAEFRAN FTEWPAAELD AYVARHYPAY WLKVELARKI RHARFLRASE
QAGHKLAVNV GFDEARGVTE LTILAVDHPW LLSIIAGACA SAGANIVDAQ IYTTTDGRAL
DTISISREYD RDEDEGRRAT RIGEMIEEVL EGKLRLPEAV ARRATNGRAK LRAFVVEPEV
SINNNWSDRY TVIEVSGLDR PGLLYQLTTA ISKLNLNIAS AHVATFGERA RDVFYVTDLL
GAQITAPTRQ AAIKRALVHL LANGDAEQQP AA