Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_4102 |
Symbol | |
ID | 5110669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 4459400 |
End bp | 4462192 |
Gene Length | 2793 bp |
Protein Length | 930 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640494326 |
Product | DNA polymerase I |
Protein accession | YP_001178807 |
Protein GI | 146313733 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000162392 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.404063 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTCAGA TCCCAGAAAA CCCTCTTATT CTCGTCGATG GCTCTTCTTA CCTTTATCGG GCGTATCATG CGTTTCCTCC TCTGACCAAT AGCGCGGGGT TGCCGACCGG CGCGATGTAC GGCGTGCTGA ACATGCTGCG CAGCCTCATC CTTCAGTATC AGCCATCGCA TGCCGCGGTC GTTTTTGACG CGAAGGGCAA AACGTTCCGC GATGAACTGT TCGAGCATTA CAAATCCCAT CGTCCACCTA TGCCGGACGA TTTGCGTGCG CAAATTGAGC CTCTGCATGC CATGGTGAAA GCAATGGGCT TGCCGTTGCT GGCCGTTTCT GGCGTTGAGG CTGATGACGT CATTGGCACG CTGGCGCGTG AAGCTGAAAA GTTAGGCCGT CCGGTGCTAA TCAGCACGGG TGATAAAGAC ATGGCTCAAC TCGTCACCCC TGGTATCACC CTAATCAATA CCATGACAAA TACCATTCTT GGGCCAGATG AAGTCGCCAC CAAATATGGC GTGCCTCCAG AACTGATTAT CGATTTCCTG GCCTTGATGG GCGACTCCTC AGATAACATC CCAGGCGTGC CGGGTGTAGG CGAAAAAACC GCGCAAGCGC TGCTTCAAGG GTTAGGTGGG CTGGACACCC TGTATGCTGA ATCAGATAAA ATCGCTGGCT TAACATTCCG TGGCGCGAAA ACGATAGCCG CGAAGCTGGA ACAGAACAAA GAGGTGGCCT ACCTCTCCTA CAAACTGGCG ACCATCAAAA CCGACGTTGA ACTGGAACTG ACCTGTGAAC AGCTTGAGGT GCAGCAGCCT ATTGCTGAAG AGCTGCTTGG CCTGTTTAAG CAATATGAAT TCAAGCGCTG GATAACCGAT GTCGAAGCCG GCAAATGGAT GCAAGCTAAG GGCAGTAAAC CTGCCGCTAA GCCAAAAGAC ACCATCGTCG TCGATGCTGA AGACGAAGTG GAAGAAGAGG CGACGACGCT CTCTTATGAT AACTATGAAG TGGTGCTTGA AGAGTCTCAG CTCATTGCCT GGGTAGAAAA ACTGAAAAAA GCGCCCGTTT TTGCTTTTGA TACCGAAACC GACAGCCTTG ATAACATCTC TGCCAATATG GTGGGTCTGT CATTTGCGAC GGAGCCTGGC ATGGCGGCTT ACGTTCCTGT CGCGCATGAC TATCTTGATG CCCCTAATCA GATCTCCCGT GAGCGCGTAC TGGAATTACT GAAACCGCTC CTCGAGGATG AGAAAGCCAA AAAAGTTGGG CAAAACCTCA AGTTTGACCG CGGCATTTTG CAAAATTACG GCATTGAGCT GCGCGGTATT GCCTATGACA CCATGCTGGA ATCGTACATT CTGAACAGTG TCGCGGGCCG CCATGATATG GATTCGCTTT CAGATCGCTG GCTTAAGCAT AAAACCGTCA CCTTTGAAGA GATCGCCGGT AAAGGCAAGA ATCAACTGAC GTTTAATCAG ATCGCGCTCG AAGAAGCGGG CCGTTACGCG GCGGAAGACG CCGACGTCAC GCTGCAACTG CATCTGAAAA TGTGGCCAAA ACTGCAAAAG CAGGAAGGCC CGCTGAATGT ATTTGAGCAT ATTGAAATGC CGCTGGTGCC CGTTATTTCG CGCATTGAGC GCAATGGCGT CAAAATTGAT CCGGCGGTGT TGCACAAACA TTCTGAGGAG CTTGCCCTAC GATTGAGTGA GCTTGAGCAA AAAGCCCATG AGATTGCCGG AGAGCCGTTC AACCTATCCT CCACCAAGCA GTTGCAGACT ATCTTGTTTG AAAAGCAAGG TATCAAGCCG CTGAAGAAAA CGCCTGGTGG TGCACCATCA ACGTCTGAAG AGGTACTGGA AGAGCTGGCG CTCGATTACC CATTACCAAA AGTGATTCTG CAATACCGTG GTCTCGCCAA GCTTAAATCG ACTTATACCG ATAAGCTGCC GTTGATGATT AACCCAAAAA CCAGTCGCGT TCACACCTCT TATCATCAGG CGGTTGCGGC GACAGGGCGT TTGTCCTCTA CTGAGCCAAA CCTGCAGAAC ATCCCGGTTC GTAATGAAGA GGGGCGCAGA ATCCGTCAGG CATTTATCGC GCCGAAAGAT TATCTGATTG TATCTGCCGA CTACTCGCAA ATTGAACTGC GCATCATGGC GCATTTGTCG CGGGATAAAG GCTTACTGAC CGCATTTGCC GAAGGCAAAG ATATCCATCG CGCGACTGCA GCAGAAGTTT TTGGTTTGCC ACTGGATAGC GTGACCAACG ACCAGCGCCG TAGTGCGAAA GCGATCAATT TTGGTCTGAT TTACGGCATG AGCGCGTTTG GCCTTTCTCG CCAGCTAAAT ATTCCGCGCA AAGAGTCGCA AAAATACATG GACCTCTACT TTGAGCGTTA TCCAGGCGTT CTGGAATACA TGGAACGCAC GCGCGCGCAG GCGAAGGAAA AAGGTTACGT TGAAACGCTG GATGGTCGTC GTCTTTATCT CCCTGACATC ACGTCAAGCA ATGCGGCTCG TCGGGCTGGG GCAGAGCGAG CGGCGATCAA CGCACCGATG CAGGGTACGG CAGCCGACAT CATCAAACGT GCAATGATTG CAGTAGATGC GTGGCTTGAA AAAGACAAAC CGCGCGTGCG CATGATCATG CAGGTACACG ATGAACTGGT GTTCGAAGTT CATCATGAAG ATCTGGAAGC CGTTTCTAAA AAGATCCATG AACTGATGGA AGGCAGCATG ACATTAGATG TGCCGTTGCT GGTGGAAGTA GGAAGCGGTG AAAACTGGGA CTTGGCTCAC TAA
|
Protein sequence | MVQIPENPLI LVDGSSYLYR AYHAFPPLTN SAGLPTGAMY GVLNMLRSLI LQYQPSHAAV VFDAKGKTFR DELFEHYKSH RPPMPDDLRA QIEPLHAMVK AMGLPLLAVS GVEADDVIGT LAREAEKLGR PVLISTGDKD MAQLVTPGIT LINTMTNTIL GPDEVATKYG VPPELIIDFL ALMGDSSDNI PGVPGVGEKT AQALLQGLGG LDTLYAESDK IAGLTFRGAK TIAAKLEQNK EVAYLSYKLA TIKTDVELEL TCEQLEVQQP IAEELLGLFK QYEFKRWITD VEAGKWMQAK GSKPAAKPKD TIVVDAEDEV EEEATTLSYD NYEVVLEESQ LIAWVEKLKK APVFAFDTET DSLDNISANM VGLSFATEPG MAAYVPVAHD YLDAPNQISR ERVLELLKPL LEDEKAKKVG QNLKFDRGIL QNYGIELRGI AYDTMLESYI LNSVAGRHDM DSLSDRWLKH KTVTFEEIAG KGKNQLTFNQ IALEEAGRYA AEDADVTLQL HLKMWPKLQK QEGPLNVFEH IEMPLVPVIS RIERNGVKID PAVLHKHSEE LALRLSELEQ KAHEIAGEPF NLSSTKQLQT ILFEKQGIKP LKKTPGGAPS TSEEVLEELA LDYPLPKVIL QYRGLAKLKS TYTDKLPLMI NPKTSRVHTS YHQAVAATGR LSSTEPNLQN IPVRNEEGRR IRQAFIAPKD YLIVSADYSQ IELRIMAHLS RDKGLLTAFA EGKDIHRATA AEVFGLPLDS VTNDQRRSAK AINFGLIYGM SAFGLSRQLN IPRKESQKYM DLYFERYPGV LEYMERTRAQ AKEKGYVETL DGRRLYLPDI TSSNAARRAG AERAAINAPM QGTAADIIKR AMIAVDAWLE KDKPRVRMIM QVHDELVFEV HHEDLEAVSK KIHELMEGSM TLDVPLLVEV GSGENWDLAH
|
| |