Gene Rpal_0594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0594 
Symbol 
ID6408244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp648096 
End bp650897 
Gene Length2802 bp 
Protein Length933 aa 
Translation table11 
GC content66% 
IMG OID642710507 
ProductPII uridylyl-transferase 
Protein accessionYP_001989629 
Protein GI192289024 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2844] UTP:GlnB (protein PII) uridylyltransferase 
TIGRFAM ID[TIGR01693] [Protein-PII] uridylyltransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAAG TCGTGTCACC TCCCCGCCCC GCCCCCGACG ATCGCTTCGA CAGCGCCCGC 
GTCGCCGCCG AAATCGCAAC GCTGGCCGAA AAACACACCG GCAACGACGC GGCGTTCCGC
ACCGCGCTGG CGATGCTGAT GAAGGCGGAG CTGGCCAAAG CCCGCACCGA GGCCGAAGCG
CAGTTGCTGC GCGACCGCCA CGGCCGGCGC TGCGCCGAGC GGCTGTGCTA CGTCCAGGAC
GCGATCATCC GGCTGCTGTT CAACGCCGCA ACTGAATACC TCTACAACAC CCCGACGCCC
TCGAGCTCGG AACGGATGAC GGTGGTGGCG ACCGGCGGCT ACGGCCGCGG CCTGATGGCG
CCGGAGAGCG ACATCGACCT GCTGTTCATC CTGCCCTACA AGCAGACCGC CTGGGGCGAG
CAGGTCGCCG AAGTCATCCT GTACTGCCTG TGGGACATCG GGCTGAAAGT CGGCCATGCC
ACCCGCTCGG TCGACGAGTG CATCCGCCAG GCGCGCGCCG ACATGACGAT CCGCACCGCG
ATCCTGGAGA CGCGGTTTCT GGCCGGCGAC GAGGCGCTGT ACGCCGAGCT GGTGGAGCGT
TTCGACAAGG AGGTGGTCGA GGGCACCGCG GCCGAATTCG TCGCCGCCAA GCTCGCCGAG
CGCGAGGAGC GCCACCGCCG CTCCGGCCAG TCACGCTATC TGGTCGAGCC CAACGTCAAG
GACGGCAAGG GTGGCCTGCG CGACCTGCAC ACGCTGTTCT GGATCGCCAA ATACGTCTAC
CGGGTTCGCG AAGCCAGCGA GCTGTCCGAG CGCGGCGTGT TCGATCCGGC CGAATTCCGC
ACGTTCCGCC GCTGCGAAGA CTTCCTGTGG TCGGTCCGCT GCAACATTCA CTTCGTGACC
AAGCGCGCCG AAGACCGGCT GTCGTTCGAT CTGCAGCGCG AGATCGGCGT GCGGCTCGGC
TACACCTCGC ATCCGGGAAT GCAGGACGTC GAGCGCTTCA TGAAGCACTA CTTCCTGATC
GCCAAGGAAG TCGGCAACCT CACCGCGATC CTGTGCGCCA AGCTGGAGGA TCAGCAGGCC
AAGGCGGCGC CGGCGCTGAC CCGAATGATG GCGCGGCTGC GGCCCGCCGC CAAGCGCCGC
CGGGTGCCGG AGAGCGACGA TTTCGTCATC GACAATAACC GCATCAATCT GGCAGTGCCG
GACGTGTTCA AGCACGATCC GGTCAATCTG ATCCGGATAT TCCGGCTGGC GCAGAAGAAC
AATCTCGCCT TCCACCCGGA CGCGATGCGC AGCGTGACGC GGTCGCTGTC GCTGATTACG
CCGCAGCTTC GCGACAATCC CGAAGCCAAC CGGCTGTTCG TCGAGATCCT GACTTCCGAC
AACGCCGAGC CGGTGCTGCG GCGGATGAAC GAGACCGGCG TGCTCGGCCG CTTCATCCGC
GCCTTCGGTC GCATCGTGTC GATGATGCAG TTCAACATGT ATCACAGCTA CACGGTGGAC
GAGCATTTGA TCCGCTGCGT CGGCAATCTG CAGGAGATCG AACGCGGCGG GAACGATGAG
TTCGCGCTGT CGTCGGAGCT GATCCGGAAG ATCAGGCCCG ATCACCGTGC GGTGCTGTAT
GCCGCGGTGC TGCTGCACGA TATCGCCAAG GGCCAGCCCG AGGATCACTC CACCGCCGGC
GCCAAGGTGG CGCGGCGGCT GTGCCCGCGC TTCGGCTTCA GCACCGCCGA CACCGAGCTG
GTGGCGTGGC TGATCGAGAA GCATCTGGTG ATGTCCACGG TGGCGCAGTC GCGCGACCTG
TCGGACCGCA AGACCATCGA GAATTTCGCC GCGGTGGTCG AGACCGTCGA GCAGATGAAG
ATGCTCACCA TCCTGACCAC CGCCGACATC CGCGGCGTCG GTCCGGGGGT GTGGAACGGC
TGGAAGGCGC AGTTGATCCG GACCTTGTAC TACGAGACCG AGCCGGTGCT GACCGGCGGC
TTCTCGGAAG TGAACCGCGC CGAGCGCATC CGCGCCGCGC AAGCCGAATT CCGCGCCGCC
TTCACCGAAT GGCCGGAGGC CGACCTCAAT GCCTATGTGG CGCGGCACTA TCCGGCGTAC
TGGCTGAAGG TCGATCTGCA GCGCAAGATC CGCCATGCCC GCTTCCTGCG CGCCTCCGAA
CAGGCCGGCC ATAAGCTGGC GATCAATGTC GGCTTCGACG AAGCCCGCGC CGTCACCGAA
CTCACCATCC TGGCGGTCGA CCATCCGTGG TTGCTGTCGG TGATCGCGGG CGCTTGTGCC
AGCGCCGGCG CCAACATCGT CGACGCCCAG ATCTACACCA CGACCGACGG CCGCGCGCTC
GACACTATCT CGATCAGCCG CGAATACGAC CGCGACGAGG ACGAGGGGCG CCGCGCCACC
CGCATCGGCG AGACCATCGA AGAGGTACTC GAAGGCAAGC TGCGGCTACC CGAGGCGGTC
GCACGCCGCG CGAGCAGCGG CAGCAAGGCC AAGCTGCGCG CCTTCGTGGT CGAGCCCGAG
GTCGAGATCA ACAACAACTG GTCCGACCGC TATACGGTGA TCGAAGTCAG CGGCCTCGAT
CGCCCGGGCC TGCTGTACCA GCTCACCACG GCGATCTCGA AACTCAACCT CAACATCGCC
TCGGCGCATG TCGCGACCTT CGGCGAACGT GCCCGCGACG TGTTCTACGT GACCGATTTG
TTAGGTGCGC AGATCACCGC TCCGACCCGC CAGGCGGCGA TCAAGCGGGC GCTGGTGCAC
CTGCTGGCCA ACGGCGACGC CGCCGAAAAG CCGGCGGCTT GA
 
Protein sequence
MDKVVSPPRP APDDRFDSAR VAAEIATLAE KHTGNDAAFR TALAMLMKAE LAKARTEAEA 
QLLRDRHGRR CAERLCYVQD AIIRLLFNAA TEYLYNTPTP SSSERMTVVA TGGYGRGLMA
PESDIDLLFI LPYKQTAWGE QVAEVILYCL WDIGLKVGHA TRSVDECIRQ ARADMTIRTA
ILETRFLAGD EALYAELVER FDKEVVEGTA AEFVAAKLAE REERHRRSGQ SRYLVEPNVK
DGKGGLRDLH TLFWIAKYVY RVREASELSE RGVFDPAEFR TFRRCEDFLW SVRCNIHFVT
KRAEDRLSFD LQREIGVRLG YTSHPGMQDV ERFMKHYFLI AKEVGNLTAI LCAKLEDQQA
KAAPALTRMM ARLRPAAKRR RVPESDDFVI DNNRINLAVP DVFKHDPVNL IRIFRLAQKN
NLAFHPDAMR SVTRSLSLIT PQLRDNPEAN RLFVEILTSD NAEPVLRRMN ETGVLGRFIR
AFGRIVSMMQ FNMYHSYTVD EHLIRCVGNL QEIERGGNDE FALSSELIRK IRPDHRAVLY
AAVLLHDIAK GQPEDHSTAG AKVARRLCPR FGFSTADTEL VAWLIEKHLV MSTVAQSRDL
SDRKTIENFA AVVETVEQMK MLTILTTADI RGVGPGVWNG WKAQLIRTLY YETEPVLTGG
FSEVNRAERI RAAQAEFRAA FTEWPEADLN AYVARHYPAY WLKVDLQRKI RHARFLRASE
QAGHKLAINV GFDEARAVTE LTILAVDHPW LLSVIAGACA SAGANIVDAQ IYTTTDGRAL
DTISISREYD RDEDEGRRAT RIGETIEEVL EGKLRLPEAV ARRASSGSKA KLRAFVVEPE
VEINNNWSDR YTVIEVSGLD RPGLLYQLTT AISKLNLNIA SAHVATFGER ARDVFYVTDL
LGAQITAPTR QAAIKRALVH LLANGDAAEK PAA