Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0300 |
Symbol | |
ID | 6973692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 331184 |
End bp | 334153 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643389831 |
Product | PII uridylyl-transferase |
Protein accession | YP_002274712 |
Protein GI | 209542483 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG2844] UTP:GlnB (protein PII) uridylyltransferase |
TIGRFAM ID | [TIGR01693] [Protein-PII] uridylyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.940535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.172702 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGGCC GACCTCCGTC CGGCCCCACG CCCATGCAGG AAATCGATCC CCCCGCGATG TCGACCCCGT CTTCCCCCTC CCAGGCTTCC ACCCCCTCGG CCGTCAGGGA CCTGACCACC AGCCTCGCGG CATCGCTCCT TTCCCCCGAG GACGGCGCGG CCGTACCGCG CGAGCAGGCC ATCGCGCTGT TCCGCCGCCA TCTCGCCCGG TTCCAGGCAT CGGTGCGCGA GGAATTCGAG GCCCATCGCC TGCATGGCAC GTCGGCCGCC AAGCAACTGG CACTGCACAC CGACGGCATG ATCCGCACCC TGGTCGATTT CACACTGGAC CACGCGCTGG CCGGCTCGAT CGGGCCTGGG GCACGCAGCC TGGCGGTCAC CGCGACGGGG GGATACGGGC GCGGCATGCT GGCGCCGTTC AGCGATATCG ACCTGCTGTT CCTGACGACC GACGAACCCT CGGCCGACGT CAGCCGCGTG GTGGAATACA TCCTGTATTT CCTGTGGGAC CTGGGGCTGA AGGTCGGGCA CGCCACGCGC TCCATCGCGC AATGCATTGC CGAGGCCGAG GCCGACACCA CCGTCCGCAC CACATTGCTG GACGCCCGGC TGCTGGCCGG CGACGCGTCG CTGTTCGCCA TGTTCGAGGC CCGGTACATC GTCGCCTGCG TCGAGGCCGG GGCCGCGCGC TTCATCTCGG ACAAGCACAA GGAACGCACG GCGCGCCATA ACCGCTTCGG CGACAGTCCC TATCTGGTCG AACCGAACGT GAAGGAAGGG CGCGGCGGCC TGCGGGACCT GCAGACCCTG TACTGGATGT GCCGCTACAC GTTCGGCACG CGCCATGTAT CCGACCTGCT GGCACCGGGC TTCAGCCGCC TGGGCCTGCT GACCGAGCAG GAGGCCAAGC GCGCCCGCCG GTCCTGGGAC TTCCTGTGGA GCGTCCGGCT GCACCTGCAT TACATCTCGG GCCGGGCGGA GGAGCGCCTG ACCTTCGACG TGCAGCCCGT GGTCGGCGCG CGCATGGGCT ACACCCGCCA TGGGCGCCAG GTGGGCGTCG AGCGCTTCAT GCGCCATTAT TTCCTGACGG TGCGCGAGGT CATGCGCCTG ACCCACGTGC TGGAACCCGC CGTGATGCGC CAGGCGCTGG GCCCGGCGGC CAACGCGCCG CAAGCCGACA GCGCGATGCG CGACGCGGGC TTCACCGTCC TGGACGGCCA GATCCTGCCG GAACGCGGCA CCTCGTTCGA TGCCGAGCCG ATCCAGATGA TGCGGCTGCT CGAATGGGCG CGCACCCGCA AGCTGCCCAT CCACCCGCTG GCCATGCACC AGCTGATCCG CTGGGAACGG CGGGCCGCCA GCCTGCGCGG CGACCCCGAG GCCGCGCGCA TCTTCCTGGA ACTGCTGTGC GGCACCCCGC CGGAGCGCAT CGGCCGCCCG CCCCATAGCG CCGAGGCCGA GAACGCGGCC GGCGAAGAGG TCCCCAGCTT CCACGCCACC GCGCAGGACC GCCGCCAAGG CAACGCCTAC TGGCTGCATA TCCTGAACGA AACCGGGATC ATGGGGCGGC TGATGCCCGA CTGGTCGCGC ATCGTCGGCC AGATGCAGTT CGACACCTAT CACGTGTTCA CGGTCGACGA GCACACGATC GAGGCCATCC GCATCTTCGG CCGGATCGAA CATGGCGCCA TGGCCGACGA AATTCCGCAG GCGTACGACC TGGCGCGCAA CCTGCAATCG CGGCGGGCCC TCTACATGGC CATCCTGCTG CACGACATCG CCAAGGGACG CGGCGGCGAC CATTCCGAAC TGGGGTCGGA AATCGCGCTG GATGTCTGCC CGGAAATGGG CCTGACCGGC GAGGAGACCG AAACCGTATC CTGGCTGGTG CTGCATCACC TGCTGCTGAG CCACACGGCC TTCCAGCGCG ACATCGACGA CCCGAAGACC ATCCTGGACT TGGCCGACAC CATCCAGTCG CCCGAGCGCC TGCGGCTGCT GCTGCTGCTG ACCATCGTGG ACATGCGCGC CGTCAGCCCG CGCGTGTGGA ATGCCTGGAA GGCCACCCTG CTGCACGAGC TGTACATGCG CGTGGCCGAG GTGCTGGAGG GCGGCCTGGC CACCACCGAA CGCGACGTGC GCGTGGCCCG CGCCAAGGAC GCGGCGGCCG AGATCCTGGA AGATGACGGG TTCAAGCGCG CGGACATCGA TCATTTCCTG GGCCTGGGCT ATGGCAGCTA CTGGCTGTCC TTCGACCAGG ACACCCACGC CCGCCATGCC GAGCTGATTC GCGAGGCCGA ACGGCACAAG GCCCCGCTGA CGGTCGAAAC CCAGCCCCTG CCCGCCCGTG GCGTGACCGA GGTCACGATC TACACCGCCG ACCATCCCGG CCTGTTCTCG CGCATGGCCG GCGCGCTGGC GATCGCGGGG GCGTCGATCG TCGATGCCCG CATCCACACG CTGATCAACG GCATGGCGCT GGACACGTTC TGGATTCAGG ACGCGGGCGG CGAGGCGTTC GAGGAACCGC ACCAGTTGGC CCGCCTGTCG GCGCTGGTCG AACAGGCGCT GTCCGGCCGG GTGGACATTC CCAAGGAAAT CGTCAGCGCC GGCCGCATGC GCTATGGGCG GCGCATGCGC GCGATCCACG TGCCACCCCG CGTGGTGATC GACAACCGGG CATCGAACAC CTACACGGTC ATCGAAATCA ACGGCCGCGA CCGCCCCGGC CTGCTGCATG ACGTGACCCA GGCGATCAGC GACCACAAAT TGCAGATCGC CTCGGCCCAT ATCACGACCT ACGGCGTACG CGCGGTGGAC GTGTTCTACG TCAAGGACCT GTTCGGCCTG AAGATCACTG ACGAGCGACG CCTGGGCGAA ATCCGCGAAG CCCTGCTGCA CGGCCTGCGC CAGGCCGAGG AAGCCATGAC CAGCGAAATC GGGCCGCCGG CGGAATCGCT GATCGCGTAG
|
Protein sequence | MEGRPPSGPT PMQEIDPPAM STPSSPSQAS TPSAVRDLTT SLAASLLSPE DGAAVPREQA IALFRRHLAR FQASVREEFE AHRLHGTSAA KQLALHTDGM IRTLVDFTLD HALAGSIGPG ARSLAVTATG GYGRGMLAPF SDIDLLFLTT DEPSADVSRV VEYILYFLWD LGLKVGHATR SIAQCIAEAE ADTTVRTTLL DARLLAGDAS LFAMFEARYI VACVEAGAAR FISDKHKERT ARHNRFGDSP YLVEPNVKEG RGGLRDLQTL YWMCRYTFGT RHVSDLLAPG FSRLGLLTEQ EAKRARRSWD FLWSVRLHLH YISGRAEERL TFDVQPVVGA RMGYTRHGRQ VGVERFMRHY FLTVREVMRL THVLEPAVMR QALGPAANAP QADSAMRDAG FTVLDGQILP ERGTSFDAEP IQMMRLLEWA RTRKLPIHPL AMHQLIRWER RAASLRGDPE AARIFLELLC GTPPERIGRP PHSAEAENAA GEEVPSFHAT AQDRRQGNAY WLHILNETGI MGRLMPDWSR IVGQMQFDTY HVFTVDEHTI EAIRIFGRIE HGAMADEIPQ AYDLARNLQS RRALYMAILL HDIAKGRGGD HSELGSEIAL DVCPEMGLTG EETETVSWLV LHHLLLSHTA FQRDIDDPKT ILDLADTIQS PERLRLLLLL TIVDMRAVSP RVWNAWKATL LHELYMRVAE VLEGGLATTE RDVRVARAKD AAAEILEDDG FKRADIDHFL GLGYGSYWLS FDQDTHARHA ELIREAERHK APLTVETQPL PARGVTEVTI YTADHPGLFS RMAGALAIAG ASIVDARIHT LINGMALDTF WIQDAGGEAF EEPHQLARLS ALVEQALSGR VDIPKEIVSA GRMRYGRRMR AIHVPPRVVI DNRASNTYTV IEINGRDRPG LLHDVTQAIS DHKLQIASAH ITTYGVRAVD VFYVKDLFGL KITDERRLGE IREALLHGLR QAEEAMTSEI GPPAESLIA
|
| |