Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_00630 |
Symbol | polA |
ID | 7759030 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 65145 |
End bp | 67871 |
Gene Length | 2727 bp |
Protein Length | 908 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643802989 |
Product | DNA polymerase I |
Protein accession | YP_002797305 |
Protein GI | 226942232 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCAGA CCCCCCTCGT CCTGGTGGAC GGCTCCTCCT ATCTGTACCG CGCCTTCCAT GCGCTGCCGC CGCTGGCCAC CTCCGGCGGC ACGCCCACGG GGGCGGTGAA GGGTGTGCTG AACATGCTGC TGGCCCTGCG CAAGCAATAT CCGGACAGCC CCTTCGCGGT GGTGTTCGAT GCCAAGGGGC CGACCTTCCG CGACGAGTTG TTCGAGCACT ACAAGTCGCA CCGCCCGCCG ATGCCGGACG ATCTGCGGGT GCAGATCGAG CCGCTGCACG CCTGCGTGCG CGCCCTCGGC CTGCCGCTGT TGTGCGTCGA GGGGGTGGAG GCGGACGATG TGATCGGCAC CCTGGCGCGC GGCAGCGCGC TCGACGGCCG GCCGGTGGTG ATCTCCACCG GCGACAAGGA CATGGCGCAA CTGGTCGACG GCCACATCAC CCTGGTCAAC ACCATGACCG GCAGCGTGCT GGACCACGCC GGAGTGGCGG AGAAGTTCGG CGTCGGCCCC GAGCTGATCA TCGACTACCT GGCGCTGATG GGCGACAAGG TGGACAACAT CCCCGGCGTG CCGGGAGTCG GCGAGAAGAC CGCGCTCGGC CTGCTGCAGG GCCTGGGCGG CGGCCTCGAT ACCATCTACG CCAACCTCGA GCGGGTGGCG ACGCTGCCGA TCCGTGGCGC CAAGTCGCTG GCCGCCAAGC TCGAGGAGCA CCGCGAGATG GCCTACCTGT CCCACCGGCT GGCGACCATC AAGACCGACG TGGCGCTGGA TATCGAGATC GACGCGCTGC ATCCGGCCGC GCCGGACCGC GAGGCGCTGA TCGCCCTCTA CCGCGAACTG GAATTCAAGA GCTGGCTGGA CGAGCTGCTG CGCGAAGCCC ATGCCGCCGG CCCCACGGCG GACCCGGCGG CGCCCGGCAC GCACTACGAG ACGCTGCTCG ACCGGGAGCG CTTCGCGGCC TGGCTGGACA AGCTGAAGCA GGCCGGGCTG ATCGCCTTCG ACAGCGAGAC CACCAGCCTG GACGCCCAGC AGGCGGAACT GGTCGGCCTG TCCTTTGCCG TCGCACCCTT GGAGGCGGCC TACATCCCGC TGGCGCACAG CTACATGGGC GTGCCCGACC AGCTCGACCG CGACGCGGTG CTCGCCGCGC TCCGGCCGAT CCTGGAAGAC CCGGCCAGGG CCAAGGTCGG CCAGCACGCC AAGTACGACA TGAACGTGCT GGCGCATTAC GGCATCGAGG TGCGCGGCGT GGCCTACGAC ACCATGCTGG AGTCCTACGT GCTGGATTCC ACGGCGACCC GCCACGACAT GGACAGCCTG GCGCTGAAAT ACCTCGGCCA GGGCACCATC CGCTTCGAGG ACATCGCCGG CAAGGGCGCC AAGCAACTGA CCTTCGACCA GATCGCCCTG GAGCAGGCCG GCCCCTACGC CGCCGAGGAC GCCGACGTGA CCCTGCGCCT GCACCAGTGC CTGTGGCAGA AGCTCGAGGC CATCCCGGCG CTGGCGAAGG TGCTGAAAGA GATCGAGATG CCGCTGGTCC CGGTGCTGGC GCGCATCGAG CGGCACGGCG CGCTGGTCGA CGCCAAGCTG CTCGGCGAGC AGAGCCTGGA ACTGGGCGAG AAGCTGCAGC AGTTGGAGCG CGAGGCCCAC GAGTTGGCCG GCGAACCCTT CAACCTCGCC TCGCCCAAGC AGCTCGGCGC CATCCTCTAC GACAAGCTGG GCCTGCCGGT GCTCTCCAAG ACCGCCAAGG GCCAGCCGTC CACCGCCGAG AGCGTGCTCG CCGACCTGGC CGAGCAGGGC TATCCGCTGC CCCAGGTGAT CATGCGCCAC CGCAGTCTGA ACAAGCTCAA GGGCACCTAC ACCGACAAAC TGCCGCAGCA GATCAACCCA CGCACCGGGC GCATCCACAC CAGCTACCAC CAGGCGGTGA CCGCCACCGG GCGGCTGTCC TCCTCGGACC CGAACCTGCA GAACATCCCG ATCCGCACCA CCGAGGGCCG ACGCATCCGC CAGGCCTTCG TCGCGCCCGA GGGCTACCGG CTGGTGGCCG CCGACTATTC GCAGATCGAA CTGCGCATCA TGGCCCACCT GGCCCAGGAT ACGAGCCTCT TGCACGCCTT CCGGAACGAC CTGGACGTGC ACCGGGCGAC CGCCGCCGAG GTGTTCGGCG TCGCCCCGGA AGCGGTCAGC GCGGACCAGC GGCGCAGCGC CAAGGCGATC AACTTCGGGC TGATCTACGG CATGAGCGCC TTCGGCCTGG CCAGGCAGAT CGGCGTCGAG CGCAAGGAGG CGCAGGCCTA CATCGACCGC TACTTCGCTC GCTATCCCGG CGTGCTCGCC TACATGGAGC GCACCCGGGC GCAGGCGGCC GAACAGGGCT ACGTGGAGAC CCTGTTCGGT CGCCGCCTGT ACCTGCCGGA GATCCACTCG AAGAACGGCG CCTTGCGCAA GGCCGCCGAG CGCACCGCGA TCAACGCACC GATGCAGGGC AGCGCGGCGG ACATCATCAA GCGCGCCATG GTGACGGTGG ATGCCTGGCT GCTGGAGAGC GGGCTGGATG CGCGGATGAT TCTGCAGGTG CACGACGAAC TGGTGCTGGA GGTGCGCGAG GATCAGGTCG AGGCGCTCAA GGCCGGCCTG CTGCCGCGCA TGAGCGGCGC CGCCGCGCTG GACGTGCCGC TGCTGGTCGA GGCCGGGGTC GGCGGCAACT GGGACGAGGC GCACTGA
|
Protein sequence | MTQTPLVLVD GSSYLYRAFH ALPPLATSGG TPTGAVKGVL NMLLALRKQY PDSPFAVVFD AKGPTFRDEL FEHYKSHRPP MPDDLRVQIE PLHACVRALG LPLLCVEGVE ADDVIGTLAR GSALDGRPVV ISTGDKDMAQ LVDGHITLVN TMTGSVLDHA GVAEKFGVGP ELIIDYLALM GDKVDNIPGV PGVGEKTALG LLQGLGGGLD TIYANLERVA TLPIRGAKSL AAKLEEHREM AYLSHRLATI KTDVALDIEI DALHPAAPDR EALIALYREL EFKSWLDELL REAHAAGPTA DPAAPGTHYE TLLDRERFAA WLDKLKQAGL IAFDSETTSL DAQQAELVGL SFAVAPLEAA YIPLAHSYMG VPDQLDRDAV LAALRPILED PARAKVGQHA KYDMNVLAHY GIEVRGVAYD TMLESYVLDS TATRHDMDSL ALKYLGQGTI RFEDIAGKGA KQLTFDQIAL EQAGPYAAED ADVTLRLHQC LWQKLEAIPA LAKVLKEIEM PLVPVLARIE RHGALVDAKL LGEQSLELGE KLQQLEREAH ELAGEPFNLA SPKQLGAILY DKLGLPVLSK TAKGQPSTAE SVLADLAEQG YPLPQVIMRH RSLNKLKGTY TDKLPQQINP RTGRIHTSYH QAVTATGRLS SSDPNLQNIP IRTTEGRRIR QAFVAPEGYR LVAADYSQIE LRIMAHLAQD TSLLHAFRND LDVHRATAAE VFGVAPEAVS ADQRRSAKAI NFGLIYGMSA FGLARQIGVE RKEAQAYIDR YFARYPGVLA YMERTRAQAA EQGYVETLFG RRLYLPEIHS KNGALRKAAE RTAINAPMQG SAADIIKRAM VTVDAWLLES GLDARMILQV HDELVLEVRE DQVEALKAGL LPRMSGAAAL DVPLLVEAGV GGNWDEAH
|
| |