Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | LGAS_1421 |
Symbol | |
ID | 4439367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Lactobacillus gasseri ATCC 33323 |
Kingdom | Bacteria |
Replicon accession | NC_008530 |
Strand | - |
Start bp | 1412802 |
End bp | 1415492 |
Gene Length | 2691 bp |
Protein Length | 896 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 639673253 |
Product | DNA polymerase I |
Protein accession | YP_815220 |
Protein GI | 116630048 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0115482 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.000000064495 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGACAGTTT TTTTATTTAA GGAGAAAAAG ATGGCACAAA AGAAATTACT TTTAATTGAC GGTAACTCTG TGGCCTTTCG TGCCTTTTAT GCTTTATACC GTCAGCTTGA TCGCTTTACT AGTCCAGATG GGTTACATAC AAATGCAATT TTTACTTTTA AGAATATGCT TGATGCGATT ATGAAGCAGA CTGATCCAAG CAATGTATTA GTTGCGTTTG ATGCAGGAAA AGTAACTTTT AGAACTAAGA TGTATCAGGA TTATAAAGGC GGTCGACAAA AAACTCCAAG TGAATTATCA GAACAATTGC CTGTAATTCG TGAAATGCTT AAAGATTTAG GAATAAAGAG CTATGAGCTA AAAAATTATG AAGCAGACGA TATTATTGGC ACACTTTCTA AAATGGGGGA AGAAGCAGGC TACACTGTTG ATATTGTAAC TGGGGATCGT GATTTAACTC AGCTTGCTTC TGATAAAACC ACGGTTTTAA TAACTAAAAA TGGAGTTGGT GATACTGAAG CATATACGCC AGAACATATG AAAGAAGTTA ATGGCGTAAC GCCAACTGAA TTTATTGATA TGAAAGCGCT GATGGGAGAC AATTCAGATA ACTATCCAGG TGTAACTAAA GTTGGTCCGA AAACAGCTTC ACGTTTAATT CAAAAATATG GCTCTGTTGA AAAACTTTAT GAACATGTGG ATGAAATGAA AAAATCCAAG TTAAAAGAGA ACCTAATTAA TGATAAAGAT AAGGCAATTT TAGCTAAAAA GTTAGCAACA ATTGATCGTG ATTCTCCAGT AGAAGTGACA CTTGCTGATA CTAAGCTAGA AGAGCCTAAT ATTGAGGACT TACGTAATTT ATATGAAAGA TTAGGATTTA AAAAATTCTT AGCTGAGTTA GGTGCAAGTG GAGTGAGTGC CGGTAAGCAA GAAAGTGAGA AGTACGAGTA TCTAGAATTA ACAAGAGAAA ATATCGCTGA CTTAGATAAA ATTAATGAAA AAGAAGTGAC ATTTTATCTG GCAATGTTGG GTGATAACTA CCATCTAGCT CCGCTTGAAG GATTTTCACT AAAGGTTAGT GATAAAATCT ATGTTTCTAA AGATGTAGTC TTACTGCAAG AAGCTCCACT TCGTCAAATG CTAGAAGATA AGAAGATTAA GAAAAATGTT TTCGACATTA AAAGAACCTA TGTAGGTTTA CATCGACTAG ATATTGATGC AGAAGGTCTA GATTATGACA TGCTCCTGGC TTCTTATTTA GTTAATAATG AAAATAATTC GAATGATCTT GGCGAAGTAG CGCATTTGTA TGATGATTAT TCAGTAAAAA CTGACTTGGA AGTTTATGGT AAAGGTAAAA AGCAAGCTGT ACCTGAAGAT GATGAGTTCT TTGAACATTT AGCAGCTAAA GTTGCTGTAA TTGAGAAGTT AAAGCAGCCA CTTTTAGAGA AATTAAAAGA TCACGAGCAA GATGACTTGT ATGAAACAAT TGAAATTCCA GTTGCTTTTG TCTTAGCTAA AATGGAAATT ACTGGGATTA AGGTTGAAGC ATCGGTTTTA AATCAATTAG GCAATGATTT TGCAGTTAAA TTACAAGAAT TAGAACATAA GATTTATCAA CAAGCCGGCG AAGAATTTAA TTTAAATTCA CCAAAACAGT TAGGACATAT TCTTTTCGAA AAATTAAACT TACCGCCAAT TAAAAAGACC AAAACTGGCT ATTCAACTTC TGTTGAAGTA TTAGAGCAAT TGAAGATGAA GAGCCCGATT GTTTCAGAAA TTTTGGATTA TCGTCAAATT GCTAAAATTC AAAATACTTA TGTTAAAGGA TTACTTGAGT GTATTCAGCC TGATAGCAGA ATCCATACCC GTTATTTACA AACATTAACT GCAACGGGGC GTCTTTCATC GGTTGATCCT AATTTACAAA ATATTCCAAC TAGAACTGAT GAGGGAAAAC AGATTAGAAA AGCTTTCGTG CCTTCAACTA AAGACGGCTA TATCTTTTCT TGTGACTACT CACAAGTTGA ATTAAGAGTT TTGGCACACG TTTCTGGGGA TGAACATATG CAGGAAGCAT TTAAGTCTGG TTATGATATT CACGCTCACA CTGCAATGAA GATTTTCCAT TTGGATTCAC CTGATGAAGT AACGCCATTA ATGCGTCGGC ATGCTAAGGC AGTCAACTTC GGAATAGTTT ACGGTATTTC TGATTATGGT TTGTCTAAGA ACTTAGGCAT TAGTCGTAAG CAAGCAAAGA CATTTATTGA TAATTACTTT GAGCAATATC CGCAAATTAA AGATTATATG GATAAGGCAA TTAAGAAAGC TCGAGAAAAC GGCTATGCAG AAACTATTAT GCATAGAAGA CGCTACTTGC CAGATATTCA TTCAAAGAAC TTTAATGTTA GAAGCTTTGC GGAAAGAACT GCAATTAATT CTCCTATTCA AGGTTCAGCT GCTGATATTA TTAAGATTGC TATGATTAAT ATGCAAAAGA AACTTGATGA ATTACATTTA AAGACTAAAA TGGTTCTACA AGTACACGAT GAACTTATTT TTGATGTACC AAAGGATGAA TTAGATACAA TTAAAAAGAT TGTGCCAGAA GTTATGCAGT CAGCTGTAAA ACTAGATGTT CCACTAATTG CTGACTCTAA CTGGGGCCAT AATTGGTATG ATGCTAAGTA A
|
Protein sequence | MTVFLFKEKK MAQKKLLLID GNSVAFRAFY ALYRQLDRFT SPDGLHTNAI FTFKNMLDAI MKQTDPSNVL VAFDAGKVTF RTKMYQDYKG GRQKTPSELS EQLPVIREML KDLGIKSYEL KNYEADDIIG TLSKMGEEAG YTVDIVTGDR DLTQLASDKT TVLITKNGVG DTEAYTPEHM KEVNGVTPTE FIDMKALMGD NSDNYPGVTK VGPKTASRLI QKYGSVEKLY EHVDEMKKSK LKENLINDKD KAILAKKLAT IDRDSPVEVT LADTKLEEPN IEDLRNLYER LGFKKFLAEL GASGVSAGKQ ESEKYEYLEL TRENIADLDK INEKEVTFYL AMLGDNYHLA PLEGFSLKVS DKIYVSKDVV LLQEAPLRQM LEDKKIKKNV FDIKRTYVGL HRLDIDAEGL DYDMLLASYL VNNENNSNDL GEVAHLYDDY SVKTDLEVYG KGKKQAVPED DEFFEHLAAK VAVIEKLKQP LLEKLKDHEQ DDLYETIEIP VAFVLAKMEI TGIKVEASVL NQLGNDFAVK LQELEHKIYQ QAGEEFNLNS PKQLGHILFE KLNLPPIKKT KTGYSTSVEV LEQLKMKSPI VSEILDYRQI AKIQNTYVKG LLECIQPDSR IHTRYLQTLT ATGRLSSVDP NLQNIPTRTD EGKQIRKAFV PSTKDGYIFS CDYSQVELRV LAHVSGDEHM QEAFKSGYDI HAHTAMKIFH LDSPDEVTPL MRRHAKAVNF GIVYGISDYG LSKNLGISRK QAKTFIDNYF EQYPQIKDYM DKAIKKAREN GYAETIMHRR RYLPDIHSKN FNVRSFAERT AINSPIQGSA ADIIKIAMIN MQKKLDELHL KTKMVLQVHD ELIFDVPKDE LDTIKKIVPE VMQSAVKLDV PLIADSNWGH NWYDAK
|
| |