Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2409 |
Symbol | polI |
ID | 5137879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 2563376 |
End bp | 2566180 |
Gene Length | 2805 bp |
Protein Length | 934 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640533862 |
Product | DNA polymerase I |
Protein accession | YP_001218310 |
Protein GI | 147675220 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000078353 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCATA TACCTGACAA TCCATTGATT TTGATCGACG GCTCCTCTTA CCTTTATCGC GCATTTCATG CTTACCCGGG CACCATGAGT AATGGTGAAA TCCCAACCAA CGCCATCTAT GGTGTGGTAA ACATGATTCG CAGCATGATG CGCCAGTTCG CTTCGGATCG CATGGCGGTG ATTTTTGATG CCAAGGGAAA AACCTTCCGT GATGAGATGT ACGATCAATA CAAGGCTCAT CGTCCTCCTA TGCCAGATGA GTTGCGCTGT CAGGTTGAAC CTTTACACCA AGTTATTCGC GCTATGGGTT TACCGCTGTT GGCGATTGAA GGTGTAGAAG CTGACGATGT GATTGGTACG TTGGCGCGCC AAGCCTCACA AGCTGGTATG CCTGTTTTGA TCAGTACCGG CGATAAAGAT ATGGCGCAGT TGGTGGATGA TAATATTACC CTGATCAATA CCATGACCAA CGTCGTGCTG GATCGCGAAG GTGTGATTGA AAAATTCGGT ATCCCGCCAG AGCTTATTAT CGACTACCTC GCCTTGATGG GTGATAAAGT GGATAACATT CCGGGCGTAC CTGGGGTGGG TGAAAAAACC GCGACCGCAC TTTTGCAAGG GATTGGCGGC TTAGAAGCGC TGTACGCCAA CCTCGATAAA ATCGCTGCTT TAGGCTTCCG TGGCTCAAAA ACCATGGCAC AAAAGCTGGA AGAGAACCGC GGTAATGCCA AGCTTTCTTA TCAACTCGCC ACCATCAAAT GTGATGTCGA GCTTGAGGAA TCTCCGCAAA CCCTGCTAAA GCAAACGCCA GATCGAGATG CGTTGATGTC GCTCTACGGT AAGCTCACCT TTAAATCTTG GCTGACCGAA TTGCTTGATG GTGGTACAGG TATTGTGACG GCGGACGAGC AAACGAAAAC CTCTTCAGTT ACTGTTTCAA CCGCAGCAAC TCATGCGGCT GCGATTCCTG AAAGCCCTGC GGCCCACATC GATCGCAGCC AGTATCAAAC CATCCTAAAC GAACAAGACT TCCAGCTTTG GTTAGAGAAG TTAAAACAAG CCGAGCTGTT TGCCTTTGAT ACGGAAACCG ACAACCTTGA TTACATGGTC GCCAATTTGG TGGGGATGTC GTTTGCAGTT GCCGAAGGTG AAGCGGCTTA TCTGCCAGTT GCGCACGATT ATCTGGATGC GCCGCAGCAG TTGGAGCGCG ACTGGGTGAT TGCCCAGCTA AAACCTCTGC TTGAAGATGA AAGCAAAGCC AAAGTGGGCC AAAACCTCAA ATACGATGCG AGCGTGATGG CGCGCTACGG CGTTGAGCTG CGTGGTATTC GCCATGACAC TATGCTGCAA TCCTACGTCT ATAACAGCGT GGGCGGCAAA CATGATATGG ATAGTTTAGC GCTGCGTTTT TTACAGCACA GCTGTATCTC GTTTGAACAA GTGGCAGGTA AGGGTAAGAA CCAGCTCACC TTTAACCAAA TCGCTTTAGA AGAAGCGGCA CAATACGCGG CAGAAGATGC GGATGTGACG TTGCGTCTGC ATCAGCGAAT TCATCCGTTG ATTGAGCAAG ACGCTAAGCT TGAGCAAGTG TATCGCGAAA TCGAAATGCC GCTGGTACCT GTGCTATCAC GTATTGAACG TACGGGCGTG ATGATCGACG ACATGCTGCT CAGTGCCCAA TCGCAAGAAA TTGCGTTACG TTTAGATCAA TTAGAACAAA ATGCTTATGA GTTGGCAGGT CAGCCGTTCA ACTTAAGCTC GCCGAAGCAG CTACAAACCA TTTTGTTTGA GCAGATGAAG CTGCCTGTCT TGCAAAAAAC GCCATCCGGT ACCCCTTCAA CCAATGAAGA AGTGTTGCAA GAGCTGGCTT TGGATTACCC ACTGCCCAAG GTTCTGATTG AGTACCGCGG TCTGGCCAAA CTCAAATCGA CCTATACCGA TAAGTTGCCG AAGATGATTA ACCCAAGTAC AGGGCGCGTG CATACTTCCT ATCATCAAGC AGTGACGGCC ACTGGGCGTC TCTCCTCCAC TGATCCTAAC TTGCAGAACA TTCCAGTGCG TAATGAAGAA GGACGCCGTA TTCGCCAAGC GTTTGTTGCT CCGCATGGTT GGAAAATCAT GGCGGTCGAC TACTCGCAAA TCGAGCTGCG CATCATGGCG CACCTGTCGG GCGATCAGGC GTTATTGGAT GCTTTCCGTG ATGGGAAAGA TATCCATGCG GCGACAGCAG CAGAAATCAT TGGTGTGCCG ATTGATCAAG TGAGCAGTGA GCAGCGTCGC CGAGCGAAAG CGGTGAACTT TGGTTTGATC TACGGCATGA GTGCGTTTGG TCTGGCAAAA CAGCTCGGTA TTCCACGCGG TGAAGCACAA GAGTATATGG ATAAATACTT TGAGCGTTAT CCCGGCGTGA TGCAGTACAT GGAAGATACG CGTAGCCGTG CTGCACAATT GGGTTATGTC GAAACCATCT TCGGTCGTCG CTTACATTTG CCGGAAATCA CCTCACGTAA CGCTATGCGT CGCAAAGCGG CTGAGCGGGC AGCGATCAAC GCACCGATGC AAGGCACCGC GGCAGACATC ATCAAAAAAG CCATGTTGTT GGTGGATGAG TGGATTGAGC GGGAAGGTGA TGGCCGAGTC AAATTGCTGA TGCAAGTACA CGATGAATTG GTCTTTGAAG TTAAAGAGTC ATCTTTATCC GAAATTGAAA GTAAAGTACA ACAGCTGATG GAGTCAGCGG CCGAGCTTGC AGTACCTTTA GTGGCCGAAG CCGGCCACGG CGACAACTGG GAGCAGGCGC ACTAG
|
Protein sequence | MAHIPDNPLI LIDGSSYLYR AFHAYPGTMS NGEIPTNAIY GVVNMIRSMM RQFASDRMAV IFDAKGKTFR DEMYDQYKAH RPPMPDELRC QVEPLHQVIR AMGLPLLAIE GVEADDVIGT LARQASQAGM PVLISTGDKD MAQLVDDNIT LINTMTNVVL DREGVIEKFG IPPELIIDYL ALMGDKVDNI PGVPGVGEKT ATALLQGIGG LEALYANLDK IAALGFRGSK TMAQKLEENR GNAKLSYQLA TIKCDVELEE SPQTLLKQTP DRDALMSLYG KLTFKSWLTE LLDGGTGIVT ADEQTKTSSV TVSTAATHAA AIPESPAAHI DRSQYQTILN EQDFQLWLEK LKQAELFAFD TETDNLDYMV ANLVGMSFAV AEGEAAYLPV AHDYLDAPQQ LERDWVIAQL KPLLEDESKA KVGQNLKYDA SVMARYGVEL RGIRHDTMLQ SYVYNSVGGK HDMDSLALRF LQHSCISFEQ VAGKGKNQLT FNQIALEEAA QYAAEDADVT LRLHQRIHPL IEQDAKLEQV YREIEMPLVP VLSRIERTGV MIDDMLLSAQ SQEIALRLDQ LEQNAYELAG QPFNLSSPKQ LQTILFEQMK LPVLQKTPSG TPSTNEEVLQ ELALDYPLPK VLIEYRGLAK LKSTYTDKLP KMINPSTGRV HTSYHQAVTA TGRLSSTDPN LQNIPVRNEE GRRIRQAFVA PHGWKIMAVD YSQIELRIMA HLSGDQALLD AFRDGKDIHA ATAAEIIGVP IDQVSSEQRR RAKAVNFGLI YGMSAFGLAK QLGIPRGEAQ EYMDKYFERY PGVMQYMEDT RSRAAQLGYV ETIFGRRLHL PEITSRNAMR RKAAERAAIN APMQGTAADI IKKAMLLVDE WIEREGDGRV KLLMQVHDEL VFEVKESSLS EIESKVQQLM ESAAELAVPL VAEAGHGDNW EQAH
|
| |