Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_0299 |
Symbol | polA |
ID | 7387592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 252729 |
End bp | 255707 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643649982 |
Product | DNA polymerase I |
Protein accession | YP_002548197 |
Protein GI | 222147240 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG GCGATCATCT CTTTCTTGTC GACGGCTCCG GCTTCATCTT CCGGGCCTTC CATGCGCTTC CGGCGCTGAC CCGCAAATCG GACGGGCTGC CGGTCGGCGC GGTCTCGGGC TTTTGCAATA TGATGTGGAA GCTGCTGACC GAAGCACGCG ACACGTCAGT CGGCGTCACA CCCACCCACC TTGCTGTGAT TTTCGACTAT TCGTCAAAAA CCTTCCGCAA GGATCTCTAT CCCGAATACA AGGCCAATCG CTCCGCCCCG CCGGAAGATC TCGTGCCACA ATTCGGACTG ATCCGCCAGG CGACGCGGGC CTTCAACCTG CCCTGCATCG AAACCGAAGG CTTCGAGGCC GATGACATTA TCGCCACCTA TGCGCGGGCC GCCGAAGCCA TCGGAGCGGA TGTCACAATC GTCTCTTCCG ACAAGGACCT GATGCAGTTG GTGACTGCCA ATGTCCATAT GTATGACAGC ATGAAGGACA AGCAGATCGG CATTCCCGAT GTCATCGAGA AATGGGGTGT GCCGCCGGAA AAGATGATCG ACCTGCAATC GCTGACCGGC GACAGCACCG ATAATATTCC CGGCATTCCC GGCATCGGCC CGAAGACGGC GGCGCAATTG CTGGAGGAAT ATGGCGATCT GGACACGCTG ATGGCTCGGG CCTCCGAAAT CAAGCAGAAC AAGCGGCGCG AAAACATTCT GGCAGGTGCC GAACTGGTCA AGCTGTCGCG GCAATTGGTA ACGCTGCGCA CCGACGTGCC GCTGGACATG CCGCTGGATG CGCTGATGCT GGAAAAGCAG GATGGACCAA AGCTGGTCGC CTTCCTGAAG GCCATGGAAT TCACCTCGCT GACACGGCGC GTCGCCGACA ATTGCGATTG TGATGCTGGC GCCATCGAGC CCGCCACGAT CTCGGTGGAA TGGGGTGCCT CGGCCCGTGG CCCGGATCTC GATGCCGGTG CCGCAGCTCA AACGCCCGCC TCCGGCCAAG AAGCCGGACC CACGGCACAA TCCGCCAAGG CCGAGGGCGC GACGCCTGCC GATCTTGCCG CGGCCCGGCA ATCGGCCTTT GGTGGCCAGC CCATCGACCG ATCCGCCTAC GTGACCATTC GCGATCTTCC CACCCTGGAA GGCTGGATTG CCGCGGCGCG GGAGGCCGGT TTTGTCGCCT TCGATACGGA AACCACCTCG CTTGACCCGA TGCAGGCTGA TCTGGTCGGT GTGTCGCTGG CCCTTCAGGA TAATGCGGCC TCGCCGGGCG CAGCAACCAT TCGCGCCGCC TATGTGCCGC TTGGCCACAA GACCGGGCGA GATGACCTGT TCAGCGACGG CCTGAAACTG GCGGAAAACC AGATCCCGAT GGATGCGGCG CTTACCGCCT TGAAGGGCCT ACTGGAAGAT GCTTCGGTTC TAAAAGTGGC GCAGAACCTG AAATACGATT ACCTTGTCAT GAAGCGGCAC GGCATTGTTA TCAGGGGCTT CGACGATACG ATGCTGCTGT CCTATGTGCT GGAAGCTGGC GTCGGCGCGC ATGGCATGGA CAGCCTGTCG GAGCGGTGGC TCGGACATAC GCCCATCCCC TATAAGGAGG TCGCTGGCTC CGGCAAATCG CTTGTCACCT TCGATCTCGT CGATATCGAC AAGGCCACCG CCTACGCCGC AGAAGATGCG GACGTCACGC TGCGTCTCTG GCTGGTGCTG AAACCACGGC TTGCCGCCGT CGGTCTGGCC CGCGTCTATG AGCGGCTGGA ACGGCCCCTG GTGCCGGTTC TGGCTGATAT GGAAGAGCGC GGCATTACCA TCGACCGACA GATCCTCTCG CGGCTGTCTG GCGAGTTGGC ACAGAAGGCC GCCGCCTTTG AGGATGAAAT TTACGAATTG GCTGGCGAGC GCTTCAATGT CGGCTCGCCC AAACAGCTGG GCGATATCCT GTTTGGCAGA ATGAACCTGC CCGGCGGCTC AAAAACCAAG ACCGGCCAAT GGTCGACGTC CGCGCAGGTG CTGGAAGATC TGGCCGCCCA GGGTGAGCCC CTGCCCCGCA AGATCGTCGA TTGGCGCCAG CTGACCAAGC TGAAATCCAC CTATACCGAC GCCCTGCCCG GCTATGTCCA CCCGCAGACC AAGCGGGTCC ACACATCCTA TTCGATGGCG GCCACCACCA CCGGTCGCCT GTCGTCATCC GAGCCGAACC TGCAAAATAT TCCGGTGCGC ACGGCAGAAG GCCGCAAGAT CCGCACCGCC TTCATTTCCA CCCCAGGCCA TAAGCTACTG TCTGCCGACT ATAGCCAGAT CGAGCTCAGG GTACTGGCGC ATGTGGCGGA TATTCCGCAA TTGCGTCAGG CCTTTGCCGA TGGGGTCGAT ATTCATGCGA TGACGGCCTC TGAAATGTTC GGCGTGCCTG TGGATGGCAT GCCGTCTGAA GTCCGCCGCC GCGCCAAGGC GATCAATTTC GGAATCATCT ACGGTATTTC CGCCTTCGGT CTTGCCAACC AGCTCAGCAT TGAGCGGGCG GAAGCGGGCG AATACATCAA GAAATATTTC GAGCGCTTCC CCGGCATCAA GGACTATATG GAAAGCACCA AGGCTTTCGT GCGTGAGCAT GGCTATGTGG AAACCATTTT CGGACGGCGC GCCCATTACC CGGAGATCAA ATCCTCAAAC CCGTCAATGC GGGCCTTCAA CGAGCGGGCA GCCATCAACG CGCCGATCCA GGGTTCCGCC GCCGATGTCA TCCGCCGCGC CATGGTGCAG GTGGAACCGG CGCTGGCCAA GGCCGGGCTT GGCGAGAAAA CCCGGATGTT GCTCCAGGTG CATGACGAAC TGATCTTCGA AGTCGAGGAT GAGGCCATCG AGGCTGCCTT GCCGGTGATC GTCTCCACCA TGGAAAATGC CGCCATGCCC GCCATCGCCA TGCGCGTCCC ACTCAAAGTC GATGCCCGCG CCGCCGATAA TTGGGACGAA GCGCATTGA
|
Protein sequence | MKKGDHLFLV DGSGFIFRAF HALPALTRKS DGLPVGAVSG FCNMMWKLLT EARDTSVGVT PTHLAVIFDY SSKTFRKDLY PEYKANRSAP PEDLVPQFGL IRQATRAFNL PCIETEGFEA DDIIATYARA AEAIGADVTI VSSDKDLMQL VTANVHMYDS MKDKQIGIPD VIEKWGVPPE KMIDLQSLTG DSTDNIPGIP GIGPKTAAQL LEEYGDLDTL MARASEIKQN KRRENILAGA ELVKLSRQLV TLRTDVPLDM PLDALMLEKQ DGPKLVAFLK AMEFTSLTRR VADNCDCDAG AIEPATISVE WGASARGPDL DAGAAAQTPA SGQEAGPTAQ SAKAEGATPA DLAAARQSAF GGQPIDRSAY VTIRDLPTLE GWIAAAREAG FVAFDTETTS LDPMQADLVG VSLALQDNAA SPGAATIRAA YVPLGHKTGR DDLFSDGLKL AENQIPMDAA LTALKGLLED ASVLKVAQNL KYDYLVMKRH GIVIRGFDDT MLLSYVLEAG VGAHGMDSLS ERWLGHTPIP YKEVAGSGKS LVTFDLVDID KATAYAAEDA DVTLRLWLVL KPRLAAVGLA RVYERLERPL VPVLADMEER GITIDRQILS RLSGELAQKA AAFEDEIYEL AGERFNVGSP KQLGDILFGR MNLPGGSKTK TGQWSTSAQV LEDLAAQGEP LPRKIVDWRQ LTKLKSTYTD ALPGYVHPQT KRVHTSYSMA ATTTGRLSSS EPNLQNIPVR TAEGRKIRTA FISTPGHKLL SADYSQIELR VLAHVADIPQ LRQAFADGVD IHAMTASEMF GVPVDGMPSE VRRRAKAINF GIIYGISAFG LANQLSIERA EAGEYIKKYF ERFPGIKDYM ESTKAFVREH GYVETIFGRR AHYPEIKSSN PSMRAFNERA AINAPIQGSA ADVIRRAMVQ VEPALAKAGL GEKTRMLLQV HDELIFEVED EAIEAALPVI VSTMENAAMP AIAMRVPLKV DARAADNWDE AH
|
| |