Gene Gdia_1331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1331 
Symbol 
ID6974739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1485435 
End bp1488584 
Gene Length3150 bp 
Protein Length1049 aa 
Translation table11 
GC content69% 
IMG OID643390863 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_002275728 
Protein GI209543499 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.235182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCAAGTT ATGCCGAACT GCAGGCCGTC ACGAATTATT CCTTCCTGCG CGGCGCCAGC 
CACCCGGATG AACTGTTCGC CCGGGCGCGG GCACTGGGCC ATGCGGCGCT GGGGATCGTC
GATCATGACG GCGTCGCCGG CATCGTGCAG TCCTGGAAGG CGGCGGAAAA ACACGGGGTG
CGGCTGGTGG TCGGGTGCCG GCTGGTCCTG CGGGACGGGC CGCCTTTGCT GGCCTATCCG
ACCGACCGTG CTGCCTGGGG GCGCCTGTGC CGGCTGCTGA CCGCATCGAA GCGCAAGGCC
GCTGACGGCA CGGCCCCGGG GCTGCGCCGC GCGGATCTGC ACAGGGCGAC CGACGGGCTG
ATCCTGATCC TGCTGCCGGA CGATCCGGAC GCGGCCCTGG CGCGGGACCT GGACTGGCTG
CGCGCGCTGT GCGGCGACCG GGGCTATCTG GCGCTGACCC TGCGCCGCCG GCCAGGCGAT
GCCGTGCGGC TGCATCGCCT GGCAGAGATG GCGCGAGCGG CCGGCGTGGC CGATGTGGTG
ACGGGCGACG TGCTGTATCA CGAACCGGCC CGGGCGATGC TGCAGGACGT CCTGACCTGC
ATCCGCACCG GCTGCACGAT CGACGAACTG GGGCAGGGGC GCGAGGCTTA TGCCGATCGG
CACCTGAAAT CCCCGGCCGA GATGGCGGTG CTGTATGCAG CACATCCCGC CGCCCTGGCG
CGGACGCGGG AGATCGCGGC ACGCTGCACC TTCTCGCCGG CCGACCTGCG CTACCAGTAT
CCCGAGGAAA GCGACGATCC GTCCGAGACC CCGCAGGACC GCCTGGAACG GCTGGCCCGC
GCCGCCCTGC GCGACCGCTA TCCCGCCGGG GCGCCTGATC CGGTGCGGCG GCAGGTGGAG
TACGAACTCG GCCTGATCCG CGACCTGCAT TATGCGCCCT ATTTCCTGAC GGTGCATACC
ATCGTCCGTC AGGCGCGGTC GCAGGGTATC CTGTGCCAGG GGCGGGGGTC GGCGGCGAAT
TCCGCCGTCT GTTTCGTGCT GGGCATCACG GAGATCGATC CGGCCTCGTC GAACCTGCTG
TTCGAACGCT TCGTATCGGC CGAACGCCAG GAACCGCCCG ATATCGACGT CGATTTCGAA
AGCGAGCGGC GCGAGGAGGT CATTCAGTGG ATCTATCGTC GCTATGGCCG GGACCGCGCC
GCCCTGTGCG CCACCACCAT GCGCTATCGC GCGCGCGGGG CGTTGCGCGA CGTGGGCAAG
GTGCTGGGCC TGCCGGCCGA CGTCACGGGC CTGCTGTCCA CCCATCTGGG CGCCCTGTCG
TTCGACGAGG ACGCGTTTCA TGAACGGGCG CGGGAACTGG GGTTGAACCT GCGCAACCGG
CGTCTGCTGC TGACGCTGCA ACTGGCGGGG GAACTGATCG GCTTTCCCCG GCAGTTGGGC
ACCCATCCGG GCGGCTTCGT GCTGACCCGC GACCGGCTGG ACGACCTGGT GCCGATCCAG
CCGGCGGCGA TGGACGACCG GCAGATCATC GTCTGGGACA AGGACGATAT CGACACGCTG
CGCTTCATGA AGGTCGACGT GCTGGGGCTG GGCATGCTGG GCTGCATGCG CCGCGCGTTC
GACATGCTGG AAGACCATTA CGGAAAACGG CTGACCCTGT CGGGCATCCC GCCAGGTGAC
GCGGACACCT ACGCCATGAT CAGCCGCGCC GACACGATCG GCACGTTCCA GATCGAAAGC
CGCGCGCAGA TGGCGATGCT GCCGCGCATG AAGCCCCGGG AATTCTATGA TCTGGTGATC
CAGGTCGCGA TCGTCCGCCC CGGCCCGATC CAGGGCGACA TGGTGCATCC CTACCTGCAA
CGGCGCGCCG ACCGGTCGCT GGTGGACTAT CCGTCGCCGG AACTGAGGGA TATCCTGCAC
AAGACGCTGG GCGTCCCGCT TTTTCAGGAA CAGGCGATGC AGATCGCCAT CCGCTGCGCC
GGCTTCACGC CGGGCGAGGC CGATGCGCTG CGCCGCGCCA TGGGCACGTT TCGCGGCCAT
GGCACCGTCA CCTATTTCCG CGACCGTCTG ATCGACGGCA TGATGGCACG GCATTATCCG
CGCGATTTCG CCGAACGGAT CTTTCGCCAG CTTGAAGGAT TCGGATCGTA CGGCTTTCCC
GAAAGCCACG CGGCGTCCTT CGCGCTGATC GCCTATGCCT CGTCCTGGAT GAAATGCCAT
TACCCGGACG TGTTCTGCGC CGCCCTGCTG AACAGCCAGC CAATGGGATT CTACCGCCCG
GCCCAGATCG TGCGCGATGC GCGCAATCAC GGCGTGACGG TCCACCCGGT CTGCGTGAAT
GCGTCGCGCT GGGACTGCAC GCTGGAACGC GCACCAGGGC GAAGTACGGC CGTCCGGCTG
GGGCTGCGGA TGGTCAAGGG CCTGTCCAAC ACGGATGCGG CCCGCCTGGT GGCCAGCCGC
ATGCCCCCCT ATGACGGGAT CGAGGATGTC TGGCGCCGGT CCGGCCTGGG CCCCGACGCG
CTGGAATGCC TGGCCCAGGC CGACGCCTTC CACGCGCTGG GGCGGGACCG CCGGGCGGCG
CTGTGGGATA TCAAGGGACT GGCGGAGTCT CCGCTGCCGC TGTTCGCGGC CGCCGACCGG
GGGCTCAACC GGCCATTGCC CGAATGCGTC GAACCGTCCG TCCCGCTGAC GGCGATGACC
GAGGGGCAGG AGGTGGTCGA GGATTACCAC GCCACCGGCC TGACCCTGCG CCGCCATCCG
GTGGCGTTCC TGCGGGACGG GCTGCGCGAT CGCGGCATGA TCGCGTGCGC CGACCTGCGC
GCCCTGCGTG ACGGGCGGCG GGTGGTGGTG CCCGGACTGG TCCTGATGCG GCAGCGGCCG
GGCACGGCGC AGGGCGTGCT GTTCATGACG ATCGAGGATG AAACCGGCAT GGCCAACCTG
GTCCTGTGGA AGGACCGGAT TGCCGCCCAG CGGCGGATCG TCCTGTCGGC CAGCCTGCTG
GCCTGCCACG GCCGCCTGCA GCGCGAGGGC GAGGTCATCC ACGTCATCAC CGAGCACCTG
GAAGACCTGA CCCCCCTGCT GTCGGAAATC GGCCGCGCGG ACCTGCGGGC GGGAATGGTA
CCGGTTATTC CCGCCCGGGA TTTCCGTTAG
 
Protein sequence
MPSYAELQAV TNYSFLRGAS HPDELFARAR ALGHAALGIV DHDGVAGIVQ SWKAAEKHGV 
RLVVGCRLVL RDGPPLLAYP TDRAAWGRLC RLLTASKRKA ADGTAPGLRR ADLHRATDGL
ILILLPDDPD AALARDLDWL RALCGDRGYL ALTLRRRPGD AVRLHRLAEM ARAAGVADVV
TGDVLYHEPA RAMLQDVLTC IRTGCTIDEL GQGREAYADR HLKSPAEMAV LYAAHPAALA
RTREIAARCT FSPADLRYQY PEESDDPSET PQDRLERLAR AALRDRYPAG APDPVRRQVE
YELGLIRDLH YAPYFLTVHT IVRQARSQGI LCQGRGSAAN SAVCFVLGIT EIDPASSNLL
FERFVSAERQ EPPDIDVDFE SERREEVIQW IYRRYGRDRA ALCATTMRYR ARGALRDVGK
VLGLPADVTG LLSTHLGALS FDEDAFHERA RELGLNLRNR RLLLTLQLAG ELIGFPRQLG
THPGGFVLTR DRLDDLVPIQ PAAMDDRQII VWDKDDIDTL RFMKVDVLGL GMLGCMRRAF
DMLEDHYGKR LTLSGIPPGD ADTYAMISRA DTIGTFQIES RAQMAMLPRM KPREFYDLVI
QVAIVRPGPI QGDMVHPYLQ RRADRSLVDY PSPELRDILH KTLGVPLFQE QAMQIAIRCA
GFTPGEADAL RRAMGTFRGH GTVTYFRDRL IDGMMARHYP RDFAERIFRQ LEGFGSYGFP
ESHAASFALI AYASSWMKCH YPDVFCAALL NSQPMGFYRP AQIVRDARNH GVTVHPVCVN
ASRWDCTLER APGRSTAVRL GLRMVKGLSN TDAARLVASR MPPYDGIEDV WRRSGLGPDA
LECLAQADAF HALGRDRRAA LWDIKGLAES PLPLFAAADR GLNRPLPECV EPSVPLTAMT
EGQEVVEDYH ATGLTLRRHP VAFLRDGLRD RGMIACADLR ALRDGRRVVV PGLVLMRQRP
GTAQGVLFMT IEDETGMANL VLWKDRIAAQ RRIVLSASLL ACHGRLQREG EVIHVITEHL
EDLTPLLSEI GRADLRAGMV PVIPARDFR