Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0087 |
Symbol | |
ID | 7316093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 91031 |
End bp | 93742 |
Gene Length | 2712 bp |
Protein Length | 903 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643614977 |
Product | DNA polymerase I |
Protein accession | YP_002512178 |
Protein GI | 220933279 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCAAGA CCCCGCCCCT GGTGCTGGTG GATGGATCCT CCTACCTGTA TCGCGCCTTC CACGCCCTGC CGCCCCTGAC CAATTCCCGG GGTGAACCCA CCGGGGCCGT CTATGGTGTG GCCAACATGC TGCGCAAGCT GCTCAAGGAC TACGCGCCGG AGCACGTGGT GGTGGTGTTC GATGCCAAGG GCAAGACCTT CCGGGACGAG ATGTACGCCG AGTACAAGGC CAACCGCCCG CCCATGCCCG ACGAGCTGGC CGCCCAGGTG GAGCCCCTGC ACCAGGTGGT CCAGGCCATG GGCCTGCCCA TGCTGGTGGT CCCCGGGGTG GAGGCGGACG ACGTGATCGC GACCCTGGCC CGCGAGGGCA GGGAACACGG CCTGGAGGTG GTGATCTCAA CCGGCGACAA GGACATGGCC CAGCTGGTGG AGCCCGGGGT GACCCTGGTG AACACCATGA CCGACACCAA AATGGATGCC GAGGGGGTCA AGGAGAAGTT CGGCGTTGCG CCGGAGCAGA TCGTGGATTA CCTGGCGCTG ATCGGCGACA CCGTGGACAA CGTGCCCGGG GTGGACAAGG TGGGGCCCAA GACCGCAGTG AAGTGGCTGG AGGCCTACGG CAGTCTCGAC GGCATCATGG AACACGCGGA GGAGATCAAG GGCAAGGTGG GCGAGAACCT GCGCGCGGCC CTGGGCCACC TGCCCCTGTC CCGGGAGCTG GTGACCGTGC GCCGGGACCT GGAACTGGAC GTGACGCCCG AGTCCCTGCG TCTCGGCGAG CCGGACCGGG AGACCCTCAG GGCGCTGTTC TCCCGCCTGG AATTCCGCAC CTGGCTCAAG GAACTGGAGT CGGGCGAAGA GCAAGCCCAG GCCGCATCCG ATGACCCGAC ACCCGGCCAG CCCGCCCAGG CCTACGAGAC GATCCTGGAC GAAAAGGCCC TGGCCGCCTG GATGAAGCGC CTGGAGACGG CAGAGCTGTT CGCCTTCGAC ACCGAGACCA CCAGCCTGGA CTACATGCAG GCGCAAGTGG TGGGGGTCTC CTTCGCCGTG AAGACCGGCG AGGCCGCCTA CCTGCCGCTC GCCCATGACT ACGCCGATGC GCCCCAGCAA CTGGATCGGG ACGAGACGCT GAAGCGCCTC AAGCCCCTGC TGGAGAGCGC CAGGCACAAG AAGCTCGGCC ATCACCTCAA ATATGACCGC AACGTGCTGC TCAACCACGG CATCGAGTTG AACGGCATCG AGCACGACAC CATGCTGGAG TCCTACGTGT TGAACAGCAC GGCGAGCCGT CACGACATGG ACAGCCTGGC CCAGAAGTAC CTGGACTACC GCACCACCCA CTACGAGGAA GTGGCCGGCA AGGGGGCCAA GCAGATCCCC TTCTCCCAGG TGCGCATCGA GGACGCCACC CCCTACGCCG CCGAGGACGC GGACATCACC CTGCGCCTGC ATGAACACCT CTGGCCGCAG CTCTCGGCCG CCGAGGGCCA GTGCCGGGTG TACCGGGAGA TCGAGATGCC CCTGGTGCCG GTGCTCTCGC GCATGGAGCG CACCGGGGTG AAGGTGGACG CCGAGCGCCT GTTCGCCCAG AGCCACGAGC TGGCCGAGCG CATGGGCCAG ATCGAGCGGG AGGCCCATGA GGTGGCCGGC GGGGCCTTCA ACCTGGGCTC ACCCAAGCAG ATCCAGGAGA TCCTGTACGA GCGCCAGAAG CTGCCGGTGC TCAGGAAGAC CCCCAAGGGC CAGCCCTCCA CCGCCGAGGA CGTGCTGGAG CAGCTGGCCC TGGACTATCC GCTGCCGAAG CTGATCCTGG AGCACCGCTC GCTGTCCAAG CTCAAGTCCA CCTACACCGA CAAGCTGCCG GAGCGCATCG ACCCGGACAC CGGCCGGGTG CACACCTCCT ATCACCAGGC GGTGGCCTCC ACCGGGCGCC TGTCCTCCTC GGACCCCAAC CTGCAGAACA TCCCCATCCG CACCGAGGAG GGCCGGCGCA TCCGCCAGGC CTTCGTGGCC CCTTCGGGCC GGCGGCTGGT AGCGGCGGAC TACTCCCAGA TCGAGCTGCG CATCATGGCC CACCTGTCCG GGGACGCCGG CCTGCTCAGG GCCTTCGCCG AGGGTCAGGA CATCCACCGG GCCACCGCCG CCGAGGTCTT CGGCGTGGGC CTCGACCAGG TCTCCGGCGA GCAGCGCCGG GCCGCCAAGG CCATCAACTT CGGGCTCATC TACGGCATGT CCGCCTTCGG CCTGGCCCGC CAGCTGGGCA TCGAGCGGGG CGACGCCCAG GACTACGTGG ACCGCTATTT CGCCCGCTAT CCCGGCGTGA AGGACTACAT GGAATCCACC CGGGAAAAGG CCCGGGATCT GGGCTACGTG GAGACCCTGT TCGGCCGGCG CCTGTACCTG CCGGACATCA ATGCCCGCAA CGGCCAGATC CGCGCCCAGG CGGAGCGGGT GGCCATCAAC GCCCCCATGC AGGGCACCGC CGCGGACATC ATCAAGCGGG CCATGATCCA GGTGGATCAA TGGATTCGCG AATCACGAAT ACCGGCGGTG ATGATCCTGC AGGTGCACGA CGAACTGGTG CTGGAGGTGG ACGAGGACGC GGTGGACAAG GTGCGTGAGG AGCTGTGCGT GCGCATGTCC CAGGCCGCCG AGCTCAAGGT GCCCCTGGTG GTGGAGGCCG GGGTGGGGGA TAACTGGGAT GATGCACATT AG
|
Protein sequence | MSKTPPLVLV DGSSYLYRAF HALPPLTNSR GEPTGAVYGV ANMLRKLLKD YAPEHVVVVF DAKGKTFRDE MYAEYKANRP PMPDELAAQV EPLHQVVQAM GLPMLVVPGV EADDVIATLA REGREHGLEV VISTGDKDMA QLVEPGVTLV NTMTDTKMDA EGVKEKFGVA PEQIVDYLAL IGDTVDNVPG VDKVGPKTAV KWLEAYGSLD GIMEHAEEIK GKVGENLRAA LGHLPLSREL VTVRRDLELD VTPESLRLGE PDRETLRALF SRLEFRTWLK ELESGEEQAQ AASDDPTPGQ PAQAYETILD EKALAAWMKR LETAELFAFD TETTSLDYMQ AQVVGVSFAV KTGEAAYLPL AHDYADAPQQ LDRDETLKRL KPLLESARHK KLGHHLKYDR NVLLNHGIEL NGIEHDTMLE SYVLNSTASR HDMDSLAQKY LDYRTTHYEE VAGKGAKQIP FSQVRIEDAT PYAAEDADIT LRLHEHLWPQ LSAAEGQCRV YREIEMPLVP VLSRMERTGV KVDAERLFAQ SHELAERMGQ IEREAHEVAG GAFNLGSPKQ IQEILYERQK LPVLRKTPKG QPSTAEDVLE QLALDYPLPK LILEHRSLSK LKSTYTDKLP ERIDPDTGRV HTSYHQAVAS TGRLSSSDPN LQNIPIRTEE GRRIRQAFVA PSGRRLVAAD YSQIELRIMA HLSGDAGLLR AFAEGQDIHR ATAAEVFGVG LDQVSGEQRR AAKAINFGLI YGMSAFGLAR QLGIERGDAQ DYVDRYFARY PGVKDYMEST REKARDLGYV ETLFGRRLYL PDINARNGQI RAQAERVAIN APMQGTAADI IKRAMIQVDQ WIRESRIPAV MILQVHDELV LEVDEDAVDK VREELCVRMS QAAELKVPLV VEAGVGDNWD DAH
|
| |