Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0278 |
Symbol | |
ID | 3917631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 296295 |
End bp | 299132 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640443007 |
Product | DNA polymerase I |
Protein accession | YP_495560 |
Protein GI | 87198303 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGAAA AGCAGCATCT CTACCTGGTC GACGGTTCGG CCTACATCTT CCGCGCCTAT CACCGCCTGC CCCCGCTGAC GAACCCGCGC GGCGTGCCTG TCGGCGCGGT CTATGGCTAT ACGACGATGC TGTGGAAACT GGCCGAGGAC CTGCACAAGG CGGACGGGCC GACGCACCTG GCGGTCATCC TCGACAAGGC GGGCACCTCG TTCCGCAACG ACCTGTACGA CCAGTACAAG GCCAATCGTC CGCCGCCGCC CGAAGATCTC GTCCCGCAGT TCCCGCTGAT CCGCGATGCC ACGCGCGCCT TCAGCCTCGC CTGCATCGAG GAGGACGCGC TCGAGGCGGA CGACCTCATC GCCTCCTACG CCCGCGCCGC CACGCTGCGC GGCTGGGACG TGACCATCGT TTCATCGGAC AAGGACCTGA TGCAGCTTGT CGGCAAGTGC GCGACCGGTG GCGGGTGCAT CGACATGCTC GACACGATGA AGAACCAGCG CATCGACATT CCCGAGGTGG TCGAGAAGTT CGGCGTGCCC CCGGAAAAGG TCGGCGACGT GCTGGCGCTG ATGGGCGATT CGGTCGACAA CGTGCCCGGC ATCCGCGGCA TCGGCCCGAA GACCGCGACC AAGCTCATCC AGGAACATGG CGATCTCGAA TCCGCGCTTG CCGCCGCGCC GACGATGAAG GCGGGCAAGC TGCGCGAAAG CCTGATCGAG CAGGCGGAAA TGGCCCGCCT CTCGCGCGTG CTGGTGGCGC TCAAGGAAGA CTGCCCGCTG CCGGTCCCGA TCGAGGACTT CAAGCTCGGC CAGATCCCGC GCGATCCGCT CGCCGCGTTT CTCGAGGAAC ACGGCTTCAC CAGCCTGCTG CGGCGGCTGG ACGACGGCAA GGGCAGCCCC GGGCCGAAGA CCCAGCTCAA CCCCGCCAAG CCCGACAACA AGGCTGCGCC CGCCGCGCCC GAAGGCAACC GCCAGGCGCT ACCCTCGTGG CCCGCGCCCG ACCTCTCGGC CTATGCCTGC GTCCAGTCGA TGGGCGAACT GCAGCAGTGG ATCGCCCGCG CCTTCGAAGC GCGGGTCGTC GCCATCGATA CCGAGACCAG CGCGCTCGAC GCGATGCGCG CCGACCTGGT GGGCGTAAGC CTCGCGCTGG GCCCCAACGA TGCGTGCTAC ATCCCGCTTG GCCACGGCGG CTCCGACATG TTCGCGGAAA AGCCCCTGCA GGTGGACCGT GCCGAGGCGC TCGCCGCGCT CAAGCCGCTG CTGGAAAGCG ACGCGGTGCT CAAGGTCGGC CAGAACATCA AGTACGACCT CAACATCCTC GCCCGGCAGG GGATCATGGT GTCACCCATC GACGACACGA TGGTGATGAG CTTCTGCCTC GATGCGGGGC GCAGCGAGAC AGGTCTTGCC GGGCACGGGA TGGACGAGCT GTCCGAGCGG CACCTGCTGC ACACGACGAT GAAGTTCAAG GAGCTGTGCG GCACCGGCAA GAAGGCGATC TCCTTTGCCG AAGTGCCGCT GGCCGACGCC ACGCGCTATG CCGCCGAGGA CGCGGACGTG ACGTGGCGCC TGCACCGCCT GTTCCAGCCG CGGCTGGCCG GGGAAGGCGG CACGCGCATC TACCAGAAGG TCGACCGGCC GCTGATCCCG GTCGTCGCGA TGATGGAACG CAACGGCATC AAGGTCGACC GCGAGCGACT GTCCGGCCTG TCGACCGAGT TCGCCGCCCA GATCGCGCGC CTCGAAGGCG TGATCCACGA GAAGTGCGGC ATGTCCTTCA CCATCGGCAG CCCCAAGCAG CTGGGCGAAG TGCTGTTCGA CAAGCTGGGC TACAAGGGCG GCAAGAAGGG CAAGACCGGG CAGTATTCGA CCGACCAGTC GGTGCTGGAA GGCCTTGCCG CGCAGGGCGC CGAAGTGGCG ACACTGGTGC TGGAATGGCG CCAGCTCTCG AAGCTCAAGT CGACCTATAC CGACGCGCTG CAGGCCGCGA TCAACCCGCA CACCGGCCGC GTCCACACCA GCTACAGCCT GGTGGGCGCG CAGACCGGCC GCCTGTCTTC AAACGAGCCT AACCTCCAGA ACATCCCGAT CCGCACCGAG ATAGGCCGCC AGATCCGCGA TGCCTTCGTC GCCGAGCCCG GAAACGTGCT GCTCGCGGCG GACTACAGCC AGATCGAACT GCGCCTTGCC GCGCACATGG CCGACGTGCC CGCGCTCAAG GAGGCGTTCG CCGCCGGCGA GGACATTCAC GCGCGTACTG CGATGGAAAT GTTCGGCACG GTCGACCGCG ATACGCGCGG ACGGGCCAAG ACGATCAACT TCGCTATCCT CTACGGCATC AGCCGGTGGG GCCTCGGCGG ACGGCTGGGC GTCAGCGCGG ACGAGGCGCA GGCGATGATC GACCGCTACT TCGAACGCTT CCCCGGCATC CAGCGCTACA TCCACGAGAC ATTGGAGAGT GTGCGCGAGC GCGGCTATTC CGAGACGCTG TTCGGCCGCA AGACCTGGTT CCCGCGCATA AGCTCGAAGA ACCAGGCCGA ACGCCAGGGC AGCGAGCGCG CTGCGATCAA CGCGCCGATC CAGGGCACCA GCGCCGACAT CATCAAGCGC GCGATGGCCC GCATGATGCC CGCGCTGACC GAAGCGGGCC TCGGCCACGT GCGCATGCTG CTGCAGGTCC ACGACGAACT CGTGTTCGAA CTGCCCCAGG GCGACGTCGA GGCGGCGAAG CCCGTGATCG AGCGCGTGAT GGCGACGGCG GCAGAACCTG CGGTCAAGCT CAGCGTGCCG CTGGGCGTCG AGATCGGCAC CGGCCTGTCG TGGGGCGCGG CGCACTGA
|
Protein sequence | MDEKQHLYLV DGSAYIFRAY HRLPPLTNPR GVPVGAVYGY TTMLWKLAED LHKADGPTHL AVILDKAGTS FRNDLYDQYK ANRPPPPEDL VPQFPLIRDA TRAFSLACIE EDALEADDLI ASYARAATLR GWDVTIVSSD KDLMQLVGKC ATGGGCIDML DTMKNQRIDI PEVVEKFGVP PEKVGDVLAL MGDSVDNVPG IRGIGPKTAT KLIQEHGDLE SALAAAPTMK AGKLRESLIE QAEMARLSRV LVALKEDCPL PVPIEDFKLG QIPRDPLAAF LEEHGFTSLL RRLDDGKGSP GPKTQLNPAK PDNKAAPAAP EGNRQALPSW PAPDLSAYAC VQSMGELQQW IARAFEARVV AIDTETSALD AMRADLVGVS LALGPNDACY IPLGHGGSDM FAEKPLQVDR AEALAALKPL LESDAVLKVG QNIKYDLNIL ARQGIMVSPI DDTMVMSFCL DAGRSETGLA GHGMDELSER HLLHTTMKFK ELCGTGKKAI SFAEVPLADA TRYAAEDADV TWRLHRLFQP RLAGEGGTRI YQKVDRPLIP VVAMMERNGI KVDRERLSGL STEFAAQIAR LEGVIHEKCG MSFTIGSPKQ LGEVLFDKLG YKGGKKGKTG QYSTDQSVLE GLAAQGAEVA TLVLEWRQLS KLKSTYTDAL QAAINPHTGR VHTSYSLVGA QTGRLSSNEP NLQNIPIRTE IGRQIRDAFV AEPGNVLLAA DYSQIELRLA AHMADVPALK EAFAAGEDIH ARTAMEMFGT VDRDTRGRAK TINFAILYGI SRWGLGGRLG VSADEAQAMI DRYFERFPGI QRYIHETLES VRERGYSETL FGRKTWFPRI SSKNQAERQG SERAAINAPI QGTSADIIKR AMARMMPALT EAGLGHVRML LQVHDELVFE LPQGDVEAAK PVIERVMATA AEPAVKLSVP LGVEIGTGLS WGAAH
|
| |