Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1770 |
Symbol | |
ID | 3918329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1864289 |
End bp | 1867936 |
Gene Length | 3648 bp |
Protein Length | 1215 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640444511 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_497044 |
Protein GI | 87199787 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.183542 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAAGC AGGCCGCGCC TCTCGCAAGA GCGCGGCCTT TTTCCTACCT TGTACACATG CCCCATGCCG CCTTCGTCCC GCTTCGCATC TTCTCGTCCT ACACGATGCT CGACGGCGCC ATCGACCCGA AGGCTATCGC CAAGACCGCC GCCGAGCGGG GCTTTCCCGC CGCCGCCATC ACAGATCGCA ACGGGCTCTA CGGTAGCGTG GCCTATGCCA AGGCGTGCAA GGACATGGGC GTCCAGCCGG TCATCGGCAC GATGCTCGCC GTAGCCCGTC CCGAACGCGA CGGCGCGGCA ACCGGCTTCG GTCCGGCCGC ACCGACGATC GACTGGCTGG CCCTCTATGC GCAGGATGCC AGGGGCTATG ACAACCTTTG CCACCTCGTC AGCCGCGCCC ATCTCGACCG GCCGCTCGAA TTTGCGCCGC ACGTCGTCCT GGCGGATCTC GTCGGCCATA CCGACGGCCT CATCTGCCTT ACCGCCGCCA GCGAAGGCGC ACTGGCGCAG CTTCTTGCCG GAGGCCAGCA AAGCGCCGCC GAGGCCTATG TCGACCGGCT GCTCGAACTG TTCCCCGAAC GGCTCTACAT CGAGATAGCC CGGCGCAACG ATGCGGTGGA GGAAGCGTCC GAAACCGCGC TCATCGATCT CGCATACGCG CGGGACCTGC CGCTGGTTGC CACGAACCCC GCCTGCTTTG CAGAGCGGAC TTTCTACGAT GCGCACGATG CCATGCTCTG CATCGCCAGT TCCACCCATG TCGATAGCGC GGATCGCCCG CGCTCCAGCA AGGAATGGTG GATCAAGCCC GCGGCGGTCA TGGAGGAACT GTTCAAGGAC CTGCCCGAAG CGCTCGCCAA CACGCTGGTC GTGGCGCAGC GCTGCGCGGT CATGCCGCCC AAGCGCAAGC CGATCCTGCC CAGCCTTGCC GGCGACCAGG AAGGCGAGGC GCGGATGTGC GCCGAGGATA GCCGCAAGGG CCTCGTCCTG CGACTTGAAC CCTATTACCC GGAAAGCACC CACGCCGAAC TCGCTCGCGT GCTCTGCCTT GGCCCCGATG CCGAACCGGT CGCCGCCGAC CATCCGCAAC TGGTCGAAGC GGGCGTCTGG GAAGAAGTTC TCGACTACCG CAAGCGGCTC GAGTTCGAGA TCGCGATCAT CAACCGCATG GGGTTCGGCG GCTACTTCCT GATCGTTGCC GACTTTATCA AGTGGGCGAA GGAACACGGC ATTCCGGTCG GGCCGGGCCG CGGTTCGGGC GCGGGTTCGC TCGTCGCTTG GGCGCTTACC ATCACCGACC TCGATCCGAT CAAGCTTGGT CTTCTGTTCG AACGCTTCCT GAATCCGGAA CGCGTCTCGA TGCCGGACTT CGACATCGAC TTCTGCGAAA CCCGCCGTGG CGAGGTGATC CGCTACGTCC AGCGCAAGTA CGGCGAGGAC CACGTCGCCC AGATCATCAC CTTCGGCAAG CTCAAGGCCC GCGCGGTGCT GCGCGACACC GGCCGTATCC TTCAGATGAG CTATGGCCAG ACTGACCGGC TCTGCAAGAT GGTCCCGAAC CATCCGACCG ATCCCTGGCC GCTGCCCCGC GCCCTCAATG GCGTGGCCGA ACTCAAGCGC GAATACGACC GCGACCCCGA GGTCAAGCGC CTGATCGACC TTGCGATGCA GCTCGAAGGC CTGCCGCGCA ACAGTTCGAC CCACGCGGCG GGCGTCGTTA TCGGCGACCG CCCGCTGGCG CAACTCGTGC CGCTCTATCG CGATCCGCGT TCGGACATGC CGGTCACGCA GTTCGACATG AAGCACGTCG AGGATGCCGG CCTCGTCAAG TTCGACTTCC TCGGCCTCAA GACGCTGTCG GTCCTGCGCA AGGCCGTAGA CCTGATGAAG AAGCGCAGCA TCGAGATCGA CCTCTCGGCG CTGCCGTGGG ACGACGAGCA AACCTACAAG CTGCTCCAGT CGGGCGACAC CGTCGGCGTT TTCCAGCTCG AATCCGAAGG CATGCGGCGC ACGCTCGCCG CGGTGAAGCC CACCAACTTC GGCGACATCA TCGCTCTCGT CTCGCTTTAC CGTCCGGGCC CGATGGACAA CATACCATTG TTCGGCCGCC GCAAGAACGG GCTCGAAGCG ATCGAGTATC CGCACGACAA GCTGGCCGGG ATCCTGTCGG AAACCTACGG CATCTTCGTC TACCAGGAAC AGGTCATGCA GGCCGCGCAG ATCCTCGCCG GATACTCGCT CGGCGACGCC GACCTCCTGC GCCGCGCCAT GGGCAAGAAG GTCCAGGCGG AAATGGACGC GCAGCGCCAG CGTTTCGTCG ACGGCTGCAA GGAAGTGTCC GGCATTCCCG CCGCAAAGGC CAACGAACTT TTCGACTTGA TCGACAAGTT CGCGGGCTAC GGCTTCAACA AGTCGCACGC AGCCGCCTAC GCGCTGCTCG CCTACCAGAC CGCCTGGCTC AAGACGCACT ATCCGCACGA ATTCTATGCG GCGGCGATGT GCTTCGACAT GCACCAGTCG GAAAAGCTCT CGGTCTTCGT CGACGACATG CGCCGCAACG GCGTGGCGCT GGCCGGGCCC GATATCAACC ACTCCGAGGC CGAGTTCACC GTCGAGCGGA CGCACGACGG CTATGCCGTG CGATATGCCC TCGCGGGCCT GCGCAACGTG GGCGAGAAGG CGATGGAGCA GATCGTCGAG GAGCGTGAGG CCAACGGCCC GTTCGCTTCG CTGGACGATC TCTTCCGCCG CATCCCGGCG GGTTCGATGA ACCGTCGCCA GCTCGAAGCG CTCGCGGCCG GTGGCGCGCT CGATTGCCTC GAGCCCAACC GCGCGCAGAT CATCGCCAAT GCCGAACTGC TCATGGCCGT GGCCGAGGAG GCCTCGCGCT CGCGCACTTC GGGGCAGGGC GGCCTTTTCG GCGGCGACGA TCACGCCACT CCCGCCACCC GCCTCTCCGA CACGAAACCG TGGAGCCGCG CGGACCAGAT GGCGGCGGAG CGCGAGAATT TCGGCTTCTA CTTTGCCGCG CACCCGGTCG AGGAATACCG CGCCGTCGCC TCGGCCAATG GTGCGCGCTC CTATGGCAGC CTGATGACGA GCAGTGGGGA GCCGGGCGGA CGCTCGGGCG CCGTCATGGC CGCGCTGGTG GAGAACGTGC AGAAGCGCAA GACGAAGAAG GGCAAGGACT TCGTCATGGC CGACTTCTCC GACAGCTCCG GCCTGTTCTC GGCCTCCTGC TTCGAGGAAA GCCTGGTCGA ACCGTTCCAG CAGTGGGCGA GGGAGGGGAC CTGCGTTCTC CTCAACGTCG AACTCGACCG CCCCAATCCG GATGAGCCGC CGCGCGTCAC CGTGCGCGGC GCCCGCCCGC TCGCCTCGGT CACCAGCGCC TCGCGCATGG TGCTCAAGCT CGATGTCAGC AGGACGGAGG CGATTTCCGA CCTCGCGATG CTCTTGCCGC GGCGGCCTGA CGGCAAGGGC GAGGTGCTTG CCCGCCTTCG TACCGGCGGG CCAAAGGAGC CTCTCGTGCG CCTCGGCAAT GATTTCATAC TCGATAGTGA CTTAATCGAA CGATTGATTC CGATAGAAGG ACTCGCCAAT GTGGCATTGA CCGCAAGACC GGAAAGGCAT CTGCGGTTGG TCGAATAG
|
Protein sequence | MKKQAAPLAR ARPFSYLVHM PHAAFVPLRI FSSYTMLDGA IDPKAIAKTA AERGFPAAAI TDRNGLYGSV AYAKACKDMG VQPVIGTMLA VARPERDGAA TGFGPAAPTI DWLALYAQDA RGYDNLCHLV SRAHLDRPLE FAPHVVLADL VGHTDGLICL TAASEGALAQ LLAGGQQSAA EAYVDRLLEL FPERLYIEIA RRNDAVEEAS ETALIDLAYA RDLPLVATNP ACFAERTFYD AHDAMLCIAS STHVDSADRP RSSKEWWIKP AAVMEELFKD LPEALANTLV VAQRCAVMPP KRKPILPSLA GDQEGEARMC AEDSRKGLVL RLEPYYPEST HAELARVLCL GPDAEPVAAD HPQLVEAGVW EEVLDYRKRL EFEIAIINRM GFGGYFLIVA DFIKWAKEHG IPVGPGRGSG AGSLVAWALT ITDLDPIKLG LLFERFLNPE RVSMPDFDID FCETRRGEVI RYVQRKYGED HVAQIITFGK LKARAVLRDT GRILQMSYGQ TDRLCKMVPN HPTDPWPLPR ALNGVAELKR EYDRDPEVKR LIDLAMQLEG LPRNSSTHAA GVVIGDRPLA QLVPLYRDPR SDMPVTQFDM KHVEDAGLVK FDFLGLKTLS VLRKAVDLMK KRSIEIDLSA LPWDDEQTYK LLQSGDTVGV FQLESEGMRR TLAAVKPTNF GDIIALVSLY RPGPMDNIPL FGRRKNGLEA IEYPHDKLAG ILSETYGIFV YQEQVMQAAQ ILAGYSLGDA DLLRRAMGKK VQAEMDAQRQ RFVDGCKEVS GIPAAKANEL FDLIDKFAGY GFNKSHAAAY ALLAYQTAWL KTHYPHEFYA AAMCFDMHQS EKLSVFVDDM RRNGVALAGP DINHSEAEFT VERTHDGYAV RYALAGLRNV GEKAMEQIVE EREANGPFAS LDDLFRRIPA GSMNRRQLEA LAAGGALDCL EPNRAQIIAN AELLMAVAEE ASRSRTSGQG GLFGGDDHAT PATRLSDTKP WSRADQMAAE RENFGFYFAA HPVEEYRAVA SANGARSYGS LMTSSGEPGG RSGAVMAALV ENVQKRKTKK GKDFVMADFS DSSGLFSASC FEESLVEPFQ QWAREGTCVL LNVELDRPNP DEPPRVTVRG ARPLASVTSA SRMVLKLDVS RTEAISDLAM LLPRRPDGKG EVLARLRTGG PKEPLVRLGN DFILDSDLIE RLIPIEGLAN VALTARPERH LRLVE
|
| |