Gene RPD_0116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0116 
Symbol 
ID4020572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp129849 
End bp132650 
Gene Length2802 bp 
Protein Length933 aa 
Translation table11 
GC content66% 
IMG OID637960293 
ProductPII uridylyl-transferase 
Protein accessionYP_567257 
Protein GI91974598 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2844] UTP:GlnB (protein PII) uridylyltransferase 
TIGRFAM ID[TIGR01693] [Protein-PII] uridylyltransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.870381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAAG TTGCGACACC TCCTCACCGG CCAGCCGGCG ACGAACGGTT CGATACCCAG 
CGGATCACCG CCGAGATCGC GGTGCTGGCC GCGCAGCATG GCGGCAACGA CCAGGCGTTC
CGCAACGCGC TCGCGCAACT GATGAAGGCC GAGCTCGCCA AAGCGCGCGC CGAGGCCGAG
GCGCAGTTGC TGCGCGATCG GCATGGCCGG CGCTGCGCCG AGCGGCTGTG CTTCGTGAAG
GACGAGATCA TCCGCCTGCT GTTCATGGCG GCGACGAAGT ATTTGTATAA TTCGCCGACG
CCGTCGAGTT CGGAGCGGAT GTCGGTGGTG GCGACCGGCG GCTACGGCCG CGGACTGATG
GCGCCGGAAT CCGACATCGA TCTGCTGTTC ATCCTACCCT ACAAGCAGAC CGCCTGGGGC
GAGCAGGTCG CCGAGGTGGT GCTGTACTCG CTGTGGGACA TCGGGCTGAA GGTCGGCCAC
GCCACCCGCT CGGTCGATGA ATGCATCCGG CAGGCGCGGG CCGACATGAC GATCCGCACC
GCAATCCTCG AAACCCGCTT CCTCACCGGC GACAAGGCGC TGTACGCCGA GTTGGTCGAA
CGCTTCGACA AGGAAGTGGT CGAGGGTACC GCCGCCGAAT TCGTCACCGC CAAGCTCGCC
GAGCGCGAGG AGCGCCATCG CCGTTCCGGC CAGTCGCGCT ATCTGGTCGA GCCCAACGTC
AAGGACGGCA AGGGGGGCTT ACGCGATCTG CACACGCTGT TCTGGATCGC GAAGTACGTC
TATCGCGTGC GCGAGACCAG TGAACTGCTC GAACGCGGCG TGTTCGATGC GATCGAATTC
CGCACCTTCC GCCGTTGCGA GGATTTTCTC TGGTCGGTGC GCTGCAATCT GCACTTCCTG
ACCCGGCGCG CCGAAGAGCG ACTGTCGTTC GACCTGCAGC GCGAGATCGG CGTGCGGCTC
GGCTACACCT CGCATCCGGG GATGCAGGAC GTCGAGCGCT TCATGAAGCA CTACTTCCTG
ATCGCCAAGG AAGTCGGCAA TCTGACCGCG ATCCTGTGCG CCAAGCTCGA GGACCAGCAG
GCCAAACCGG CGCCGGCGCT GTCGCGGATG ATGGCGCGGC TGCGGCCGGG CGCGACACGC
CGCCGCGTGC CGGAAAGCGA CGATTTCGTC ATCGACAACA ATCGGATCAA TCTCGCGGCG
CCGGACGTGT TCAAGCACGA CCCGGTCAAT CTGATCCGGA TCTTCCGGCT GGCGCAGAAG
AACAGCCTCG CCTTCCACCC CGACGCGATG CGGACGGTGA CGCGGTCGCT GGCGCTGATC
AACGCGCAGC TCCGCGACGA TCCGGAAGCC AACCGGCTGT TCATCGAGAT CCTGACCTCC
GACAATTCCG AGCCGGTGCT GCGGCGGATG AACGAGACCG GCGTGCTCGG CCGTTTCATC
CGCGCCTTCG GCCGCATCGT CTCGATGATG CAGTTCAACA TGTACCACAG CTACACGGTG
GACGAGCATC TGCTCCGCTG CGTCGGCAAT CTTCAGGAGA TCGAGCGCGG CGGCAACGAC
GAGTTCATGC TCTCCTCCGA CCTGACCCGC AGGATCCGCC CGGACCATCG CGCGGTGCTG
TATGTCGCGG TGCTGCTGCA CGACATCGCC AAGGGCCAGC CCGAGGATCA CTCCACCGCC
GGCGCCAAGG TGGCGCGGCG GCTGTGTCCA CGCTTCGGCT TCAACGCCGC CGACACCGAG
CTGATCGCCT GGCTGATCGA AAAGCATCTG GTGATGTCGA CGGTGGCGCA GTCGCGCGAT
CTGTCGGATC GCAAGACCAT CGAGAACTTC GCCGCCGTGG TCGAATCGGT CGAGCAGATG
AAGCTGCTGA CCATCCTCAC CACCGCCGAC ATCCGCGGCG TCGGCCCCGG CGTCTGGAAC
GGCTGGAAGG CGCAGCTGCT TCGCACGCTG TATTATGAGA CCGAGCCGGT GCTGACCGGC
GGCTTTTCCG AGGTCAACCG CGCCGAGCGT ATCCGCGCCG CGCAGGCGGA GTTCCGCGCC
GCCTTCACCG AATGGCCGGA AGCCGAACTC GACGCCTATG TGGCGCGACA CTACCCGGCC
TATTGGCTCA AGGTCGAGCT GGCGCGCAAG CTGCGCCACG CCCGTTTCCT GCGCGCCTCC
GAACAGGCCG GCAACAAGCT CGCGGTCAAT GTCGGCTTCG ACGAGGCGCG TGGCGTCACC
GAACTGACCA TTCTGGCGGT CGACCATCCC TGGTTGCTGT CGATCATCGC CGGCGCCTGC
GCCTCGGCCG GCGCCAATAT CGTCGACGCC CAGATCTACA CCACCACCGA CGGCCGGGCG
CTCGACACCA TTTCGATCCG CCGCGAATAC GATCGCGACG AGGACGAAGG CCGGCGAGCT
ACGCGGATCG GCGAGATCAT CGAGGAGGTG CTGGAGGGCA AGCTGCGGCT CCCCGAGGCG
GTCGCCCGCC GCGCCACCAG CAGCAAGACC AAGCTTCGCG CTTTCGTGGT CGAACCCGAA
ATCTCGATCA ACAACAATTG GTCGGACCGC TACACCGTGA TCGAGGTGTC CGGCCTCGAC
CGCCCCGGCC TGTTGTATCA GCTCACCACC GCGATCTCGA AGCTGAACCT CAACATCGCC
TCGGCGCATG TCGCGACCTT CGGCGAGCGC GCCCGCGACG TGTTCTACGT CACAGACCTG
CTCGGCGCCC AGATCACCGC GCCGACCAGG CAGGCCGCGA TCAAGCGCGC TTTGGTGCAT
CTGCTCGCCA ACGGCGACGC CGAACAGAAG CCGGCGGCGT GA
 
Protein sequence
MDKVATPPHR PAGDERFDTQ RITAEIAVLA AQHGGNDQAF RNALAQLMKA ELAKARAEAE 
AQLLRDRHGR RCAERLCFVK DEIIRLLFMA ATKYLYNSPT PSSSERMSVV ATGGYGRGLM
APESDIDLLF ILPYKQTAWG EQVAEVVLYS LWDIGLKVGH ATRSVDECIR QARADMTIRT
AILETRFLTG DKALYAELVE RFDKEVVEGT AAEFVTAKLA EREERHRRSG QSRYLVEPNV
KDGKGGLRDL HTLFWIAKYV YRVRETSELL ERGVFDAIEF RTFRRCEDFL WSVRCNLHFL
TRRAEERLSF DLQREIGVRL GYTSHPGMQD VERFMKHYFL IAKEVGNLTA ILCAKLEDQQ
AKPAPALSRM MARLRPGATR RRVPESDDFV IDNNRINLAA PDVFKHDPVN LIRIFRLAQK
NSLAFHPDAM RTVTRSLALI NAQLRDDPEA NRLFIEILTS DNSEPVLRRM NETGVLGRFI
RAFGRIVSMM QFNMYHSYTV DEHLLRCVGN LQEIERGGND EFMLSSDLTR RIRPDHRAVL
YVAVLLHDIA KGQPEDHSTA GAKVARRLCP RFGFNAADTE LIAWLIEKHL VMSTVAQSRD
LSDRKTIENF AAVVESVEQM KLLTILTTAD IRGVGPGVWN GWKAQLLRTL YYETEPVLTG
GFSEVNRAER IRAAQAEFRA AFTEWPEAEL DAYVARHYPA YWLKVELARK LRHARFLRAS
EQAGNKLAVN VGFDEARGVT ELTILAVDHP WLLSIIAGAC ASAGANIVDA QIYTTTDGRA
LDTISIRREY DRDEDEGRRA TRIGEIIEEV LEGKLRLPEA VARRATSSKT KLRAFVVEPE
ISINNNWSDR YTVIEVSGLD RPGLLYQLTT AISKLNLNIA SAHVATFGER ARDVFYVTDL
LGAQITAPTR QAAIKRALVH LLANGDAEQK PAA