Gene Gdia_0300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0300 
Symbol 
ID6973692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp331184 
End bp334153 
Gene Length2970 bp 
Protein Length989 aa 
Translation table11 
GC content68% 
IMG OID643389831 
ProductPII uridylyl-transferase 
Protein accessionYP_002274712 
Protein GI209542483 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2844] UTP:GlnB (protein PII) uridylyltransferase 
TIGRFAM ID[TIGR01693] [Protein-PII] uridylyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.940535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.172702 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGCC GACCTCCGTC CGGCCCCACG CCCATGCAGG AAATCGATCC CCCCGCGATG 
TCGACCCCGT CTTCCCCCTC CCAGGCTTCC ACCCCCTCGG CCGTCAGGGA CCTGACCACC
AGCCTCGCGG CATCGCTCCT TTCCCCCGAG GACGGCGCGG CCGTACCGCG CGAGCAGGCC
ATCGCGCTGT TCCGCCGCCA TCTCGCCCGG TTCCAGGCAT CGGTGCGCGA GGAATTCGAG
GCCCATCGCC TGCATGGCAC GTCGGCCGCC AAGCAACTGG CACTGCACAC CGACGGCATG
ATCCGCACCC TGGTCGATTT CACACTGGAC CACGCGCTGG CCGGCTCGAT CGGGCCTGGG
GCACGCAGCC TGGCGGTCAC CGCGACGGGG GGATACGGGC GCGGCATGCT GGCGCCGTTC
AGCGATATCG ACCTGCTGTT CCTGACGACC GACGAACCCT CGGCCGACGT CAGCCGCGTG
GTGGAATACA TCCTGTATTT CCTGTGGGAC CTGGGGCTGA AGGTCGGGCA CGCCACGCGC
TCCATCGCGC AATGCATTGC CGAGGCCGAG GCCGACACCA CCGTCCGCAC CACATTGCTG
GACGCCCGGC TGCTGGCCGG CGACGCGTCG CTGTTCGCCA TGTTCGAGGC CCGGTACATC
GTCGCCTGCG TCGAGGCCGG GGCCGCGCGC TTCATCTCGG ACAAGCACAA GGAACGCACG
GCGCGCCATA ACCGCTTCGG CGACAGTCCC TATCTGGTCG AACCGAACGT GAAGGAAGGG
CGCGGCGGCC TGCGGGACCT GCAGACCCTG TACTGGATGT GCCGCTACAC GTTCGGCACG
CGCCATGTAT CCGACCTGCT GGCACCGGGC TTCAGCCGCC TGGGCCTGCT GACCGAGCAG
GAGGCCAAGC GCGCCCGCCG GTCCTGGGAC TTCCTGTGGA GCGTCCGGCT GCACCTGCAT
TACATCTCGG GCCGGGCGGA GGAGCGCCTG ACCTTCGACG TGCAGCCCGT GGTCGGCGCG
CGCATGGGCT ACACCCGCCA TGGGCGCCAG GTGGGCGTCG AGCGCTTCAT GCGCCATTAT
TTCCTGACGG TGCGCGAGGT CATGCGCCTG ACCCACGTGC TGGAACCCGC CGTGATGCGC
CAGGCGCTGG GCCCGGCGGC CAACGCGCCG CAAGCCGACA GCGCGATGCG CGACGCGGGC
TTCACCGTCC TGGACGGCCA GATCCTGCCG GAACGCGGCA CCTCGTTCGA TGCCGAGCCG
ATCCAGATGA TGCGGCTGCT CGAATGGGCG CGCACCCGCA AGCTGCCCAT CCACCCGCTG
GCCATGCACC AGCTGATCCG CTGGGAACGG CGGGCCGCCA GCCTGCGCGG CGACCCCGAG
GCCGCGCGCA TCTTCCTGGA ACTGCTGTGC GGCACCCCGC CGGAGCGCAT CGGCCGCCCG
CCCCATAGCG CCGAGGCCGA GAACGCGGCC GGCGAAGAGG TCCCCAGCTT CCACGCCACC
GCGCAGGACC GCCGCCAAGG CAACGCCTAC TGGCTGCATA TCCTGAACGA AACCGGGATC
ATGGGGCGGC TGATGCCCGA CTGGTCGCGC ATCGTCGGCC AGATGCAGTT CGACACCTAT
CACGTGTTCA CGGTCGACGA GCACACGATC GAGGCCATCC GCATCTTCGG CCGGATCGAA
CATGGCGCCA TGGCCGACGA AATTCCGCAG GCGTACGACC TGGCGCGCAA CCTGCAATCG
CGGCGGGCCC TCTACATGGC CATCCTGCTG CACGACATCG CCAAGGGACG CGGCGGCGAC
CATTCCGAAC TGGGGTCGGA AATCGCGCTG GATGTCTGCC CGGAAATGGG CCTGACCGGC
GAGGAGACCG AAACCGTATC CTGGCTGGTG CTGCATCACC TGCTGCTGAG CCACACGGCC
TTCCAGCGCG ACATCGACGA CCCGAAGACC ATCCTGGACT TGGCCGACAC CATCCAGTCG
CCCGAGCGCC TGCGGCTGCT GCTGCTGCTG ACCATCGTGG ACATGCGCGC CGTCAGCCCG
CGCGTGTGGA ATGCCTGGAA GGCCACCCTG CTGCACGAGC TGTACATGCG CGTGGCCGAG
GTGCTGGAGG GCGGCCTGGC CACCACCGAA CGCGACGTGC GCGTGGCCCG CGCCAAGGAC
GCGGCGGCCG AGATCCTGGA AGATGACGGG TTCAAGCGCG CGGACATCGA TCATTTCCTG
GGCCTGGGCT ATGGCAGCTA CTGGCTGTCC TTCGACCAGG ACACCCACGC CCGCCATGCC
GAGCTGATTC GCGAGGCCGA ACGGCACAAG GCCCCGCTGA CGGTCGAAAC CCAGCCCCTG
CCCGCCCGTG GCGTGACCGA GGTCACGATC TACACCGCCG ACCATCCCGG CCTGTTCTCG
CGCATGGCCG GCGCGCTGGC GATCGCGGGG GCGTCGATCG TCGATGCCCG CATCCACACG
CTGATCAACG GCATGGCGCT GGACACGTTC TGGATTCAGG ACGCGGGCGG CGAGGCGTTC
GAGGAACCGC ACCAGTTGGC CCGCCTGTCG GCGCTGGTCG AACAGGCGCT GTCCGGCCGG
GTGGACATTC CCAAGGAAAT CGTCAGCGCC GGCCGCATGC GCTATGGGCG GCGCATGCGC
GCGATCCACG TGCCACCCCG CGTGGTGATC GACAACCGGG CATCGAACAC CTACACGGTC
ATCGAAATCA ACGGCCGCGA CCGCCCCGGC CTGCTGCATG ACGTGACCCA GGCGATCAGC
GACCACAAAT TGCAGATCGC CTCGGCCCAT ATCACGACCT ACGGCGTACG CGCGGTGGAC
GTGTTCTACG TCAAGGACCT GTTCGGCCTG AAGATCACTG ACGAGCGACG CCTGGGCGAA
ATCCGCGAAG CCCTGCTGCA CGGCCTGCGC CAGGCCGAGG AAGCCATGAC CAGCGAAATC
GGGCCGCCGG CGGAATCGCT GATCGCGTAG
 
Protein sequence
MEGRPPSGPT PMQEIDPPAM STPSSPSQAS TPSAVRDLTT SLAASLLSPE DGAAVPREQA 
IALFRRHLAR FQASVREEFE AHRLHGTSAA KQLALHTDGM IRTLVDFTLD HALAGSIGPG
ARSLAVTATG GYGRGMLAPF SDIDLLFLTT DEPSADVSRV VEYILYFLWD LGLKVGHATR
SIAQCIAEAE ADTTVRTTLL DARLLAGDAS LFAMFEARYI VACVEAGAAR FISDKHKERT
ARHNRFGDSP YLVEPNVKEG RGGLRDLQTL YWMCRYTFGT RHVSDLLAPG FSRLGLLTEQ
EAKRARRSWD FLWSVRLHLH YISGRAEERL TFDVQPVVGA RMGYTRHGRQ VGVERFMRHY
FLTVREVMRL THVLEPAVMR QALGPAANAP QADSAMRDAG FTVLDGQILP ERGTSFDAEP
IQMMRLLEWA RTRKLPIHPL AMHQLIRWER RAASLRGDPE AARIFLELLC GTPPERIGRP
PHSAEAENAA GEEVPSFHAT AQDRRQGNAY WLHILNETGI MGRLMPDWSR IVGQMQFDTY
HVFTVDEHTI EAIRIFGRIE HGAMADEIPQ AYDLARNLQS RRALYMAILL HDIAKGRGGD
HSELGSEIAL DVCPEMGLTG EETETVSWLV LHHLLLSHTA FQRDIDDPKT ILDLADTIQS
PERLRLLLLL TIVDMRAVSP RVWNAWKATL LHELYMRVAE VLEGGLATTE RDVRVARAKD
AAAEILEDDG FKRADIDHFL GLGYGSYWLS FDQDTHARHA ELIREAERHK APLTVETQPL
PARGVTEVTI YTADHPGLFS RMAGALAIAG ASIVDARIHT LINGMALDTF WIQDAGGEAF
EEPHQLARLS ALVEQALSGR VDIPKEIVSA GRMRYGRRMR AIHVPPRVVI DNRASNTYTV
IEINGRDRPG LLHDVTQAIS DHKLQIASAH ITTYGVRAVD VFYVKDLFGL KITDERRLGE
IREALLHGLR QAEEAMTSEI GPPAESLIA