Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0032 |
Symbol | |
ID | 3909715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 30131 |
End bp | 32929 |
Gene Length | 2799 bp |
Protein Length | 932 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637881913 |
Product | PII uridylyl-transferase |
Protein accession | YP_483655 |
Protein GI | 86747159 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG2844] UTP:GlnB (protein PII) uridylyltransferase |
TIGRFAM ID | [TIGR01693] [Protein-PII] uridylyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.968292 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.957944 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAAAG TCGTGACACC TCAACGGCCG GCCGCCGACG AGCGGTTCGA TACAACGCGG ATCACCGCCG ACATCGCGGC GCTGGCCGGG CAGCATGCCG GCAACGAGCC GACGTTCCGC ACCGCGCTCG CGCAACTCAT GAAGGCCGAA CTCGCCAAGG CCCGCGCGGC GGCCGAGGCG CAGTTGCTGC GTGACCGCCA CGGCCGGCGC TGTGCCGAGC ATCTGTGCTT CGTCCAGGAC GAGATCATCC GGCTGCTGTT CATGGCGGCG ACTCAATATC TTTACAATTC GCCGACGCCG TCGAGCTCGG AGCGGATGTC GGTGGTCGCC ACCGGCGGCT ACGGCCGCGG CCTGATGGCG CCGGAGAGCG ACATCGATCT GCTGTTCATC CTGCCCTACA AGCAGACCGC CTGGGGCGAG CAGGTCGCCG AGGTGATCCT GTATTGCCTG TGGGACATCG GTCTCAAGGT CGGCCACGCC ACCCGCTCGG TCGACGAGTC GATCCGTCAG GCCCGGGCCG ACATGACGAT CCGCACCGCG ATCCTGGAAA CCCGCTTCCT GACCGGCGAC AAGGCGCTGT ATCAGGAACT GGTCGACCGC TTCGACAAGG AAGTGGTCGA GGGCACCGCC GCCGAATTCG TCACCGCCAA GCTCGCCGAG CGCGAGGAGC GGCATCGCCG CTCCGGCCAG TCGCGCTATC TGGTCGAGCC CAACGTCAAG GACGGCAAGG GCGGCTTGCG CGACCTGCAC ACGCTGTTCT GGATCGCCAA ATACGTCTAC CGCGTTCGCG GCACCAGCGA ACTGCTCGAA CACGGCGTGT TCGACCAGAT CGAATTCCGC ACCTTCCGCC GCTGCGAGGA TTTCCTCTGG TCGGTGCGCT GCAATCTGCA TTTCGTGACG CGGCGCGCCG AGGAGCGGCT GTCGTTCGAC CTGCAGCGCG AGATCGGCGT GCGGCTCGGC TACACCTCGC ATCCCGGGAT GCAGGACGTC GAGCGCTTCA TGAAGCACTA CTTCCTGATC GCCAAGAAGG TCGGCAATCT GACCGCTATC CTGTGCGCCA AGCTCGAGGA CCAGCAGGCC AAGCCGGCGC CGGCGCTGTC GCGGATGATG GCGCGGCTGC GGCCCGGCGC GACGCGACGC CGGGTACCCG AGAGCGACGA CTTCGTCATC GACAACAACC GCATCAATTG CGCGATGCCG GACGTGTTCA AGCACGACCC GGTCAATCTG ATCCGGATCT TCCGGTTGGC GCAGAAGAAC AACCTCGCCT TCCATCCCGA CGCGATGCGG ACGGTGACGC GCTCGCTGGC ACTGATCAAC GCCCAGCTCC GCGACGATCC CGAGGCCAAT CGACTGTTCA TCGAGATCCT GACCTCGGAC AATGCCGAAC AGGTGCTGCG GCGGATGAAC GAGACCGGCG TGCTCGGCCG CTTCATCCGC GCCTTCGGCC GCATCGTCTC GATGATGCAG TTCAACATGT ATCACAGCTA CACGGTGGAC GAGCATCTGC TGCGCTGTGT CGGCAACCTC CAGGAGATCG AGCGCGGCGG CAACGACGAG TTCATCCTGT CGTCCGACCT GATCCGCAAG ATCCGCCCCG ACCACCGTGC GGTGCTGTAT GTCGCGGTGC TGCTGCACGA CATCGCCAAG GGCCAGCCGG AGGACCACTC CACCGCCGGC GCCAAGGTGG CGCGGCGGCT GTGCCCGCGG TTCGGCTTCA ACGCCGCCGA CACCGAACTG ATCGCCTGGC TGATCGACAA GCATCTGGTG ATGTCCACCG TGGCGCAATC ACGCGACCTG TCGGACCGCA AGACCATCGA GAATTTCGCC GCCGTGGTCG AGTCGGTCGA GCAGATGAAG CTCTTGACCA TCCTCACCAC CGCCGACATT CGCGGCGTCG GCCCGGGCGT CTGGAACGGC TGGAAGGCGC AGCTGTTGCG CACGCTGTAT TACGAGACCG AGCCGGTGCT GACCGGCGGC TTCTCCGAGG TCAACCGCGC CGAGCGCATC CGCGGCGCGC AGGCGGAATT CCGCGCCAAT TTCACCGAAT GGCCGGCGGC CGAGCTCGAC GCCTACGTGG CGCGGCACTA CCCGGCGTAC TGGCTCAAGG TCGAGCTGGC GCGCAAGATC CGCCACGCCC GCTTCCTGCG CGCCTCCGAA CAGGCCGGCC ACAAGCTCGC GGTCAATGTC GGCTTCGACG AGGCGCGCGG CGTCACCGAA CTGACGATCC TGGCGGTCGA CCATCCCTGG CTGCTGTCGA TCATCGCCGG CGCCTGCGCC TCGGCCGGCG CCAACATCGT CGACGCCCAG ATCTACACCA CCACCGACGG CCGCGCGCTC GACACCATCT CGATCAGCCG CGAATACGAT CGCGACGAGG ACGAGGGCCG CCGCGCGACG CGGATCGGCG AGATGATCGA GGAAGTGCTG GAAGGCAAGC TGCGGCTGCC CGAAGCGGTG GCGCGGCGCG CCACCAACGG CCGCGCCAAG CTGCGCGCCT TCGTGGTCGA GCCGGAAGTC TCGATCAATA ACAATTGGTC GGACCGCTAC ACCGTGATCG AGGTCAGTGG CCTCGACCGC CCCGGCCTGC TGTATCAGCT CACCACCGCG ATCTCGAAGC TGAACCTCAA CATCGCCTCG GCGCATGTCG CCACCTTCGG CGAACGCGCC CGCGACGTGT TCTACGTCAC CGATCTGCTC GGCGCCCAGA TCACCGCGCC GACCCGGCAG GCCGCGATCA AGCGCGCGCT GGTTCACCTG CTCGCCAATG GCGACGCGGA GCAACAGCCG GCGGCGTGA
|
Protein sequence | MDKVVTPQRP AADERFDTTR ITADIAALAG QHAGNEPTFR TALAQLMKAE LAKARAAAEA QLLRDRHGRR CAEHLCFVQD EIIRLLFMAA TQYLYNSPTP SSSERMSVVA TGGYGRGLMA PESDIDLLFI LPYKQTAWGE QVAEVILYCL WDIGLKVGHA TRSVDESIRQ ARADMTIRTA ILETRFLTGD KALYQELVDR FDKEVVEGTA AEFVTAKLAE REERHRRSGQ SRYLVEPNVK DGKGGLRDLH TLFWIAKYVY RVRGTSELLE HGVFDQIEFR TFRRCEDFLW SVRCNLHFVT RRAEERLSFD LQREIGVRLG YTSHPGMQDV ERFMKHYFLI AKKVGNLTAI LCAKLEDQQA KPAPALSRMM ARLRPGATRR RVPESDDFVI DNNRINCAMP DVFKHDPVNL IRIFRLAQKN NLAFHPDAMR TVTRSLALIN AQLRDDPEAN RLFIEILTSD NAEQVLRRMN ETGVLGRFIR AFGRIVSMMQ FNMYHSYTVD EHLLRCVGNL QEIERGGNDE FILSSDLIRK IRPDHRAVLY VAVLLHDIAK GQPEDHSTAG AKVARRLCPR FGFNAADTEL IAWLIDKHLV MSTVAQSRDL SDRKTIENFA AVVESVEQMK LLTILTTADI RGVGPGVWNG WKAQLLRTLY YETEPVLTGG FSEVNRAERI RGAQAEFRAN FTEWPAAELD AYVARHYPAY WLKVELARKI RHARFLRASE QAGHKLAVNV GFDEARGVTE LTILAVDHPW LLSIIAGACA SAGANIVDAQ IYTTTDGRAL DTISISREYD RDEDEGRRAT RIGEMIEEVL EGKLRLPEAV ARRATNGRAK LRAFVVEPEV SINNNWSDRY TVIEVSGLDR PGLLYQLTTA ISKLNLNIAS AHVATFGERA RDVFYVTDLL GAQITAPTRQ AAIKRALVHL LANGDAEQQP AA
|
| |