Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1538 |
Symbol | |
ID | 3917213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1584662 |
End bp | 1587898 |
Gene Length | 3237 bp |
Protein Length | 1078 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640444279 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_496813 |
Protein GI | 87199556 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGGGT CCCAGCCCTT TGCCGAACTG GTCTCTGCCA CCAATTACAG CTTCCTGCGT GGGGCTTCGC ATCCCGCGCA GATGGTCGCG CGGGCACACC AGCTTGGCAT GGCAGGATTG GGCGTGGCCG ATCGCAATAC CGTTGCCGGA GTGGTGCGCG CGCACGTGGG CTGGCGCGAA GTAGGGGGAC TTGCCAGTGG CTTCCGATTG GCGGTCGGAG CCAGGCTGGT CTTTGCCGAC GGAACGCCGG ATGTGGTGGC CTATCCCTCC ACGCTTCGTG GCTGGCGCCG GCTGACGCGG CTGTTGACGC TTGGCAACAG GCGGGCGCAA AAAGGCGATT GCACGCTCTA CCTGTCGGAT CTCCTGGACC ATGCCGACGA TCTTTTGCTG ATCGCCACCG GTCACGATCG GCGGGTGCTG GAAAAACTGA GGGAAGCACG TCCCGGCGCG GTGTGGCTGG CCGTTACGAT GCAGCGTGGC GGCACCGACG CACGGCGGTT GGCCCAGGCA CAGGCTCTGG GCGCGGCGTG TGGCGTGCCC CTTCTGGCCA CCTGCGATGC GCTCTATGAC CATCCCGATG CGCGGCCGCT GCACGATGTT CTGACGTGCA TTCGCGAAGG CGTGACCATT GCCGAGGCAG GGCAACGTCT GCTCGCCAAT GCGGAACGCC ATCTCAAACT CCCTGCGGAA ATGGCTCGCC TTTTTCGGAG TCATCCCGAA GCCGTGACGG CCAGCGTCGA GATATTGTCG CGTATCACCT TCACGCTCGA CGACCTGCGC TACGAATATC CGCACGAGCC AGTACCCGAG GGCTGGCAAC CGCAGGCGTG GCTGGAACAT CTGGTGCAAC AAGGGGGGGA GGCGCTGTTC CCCGATGGCT TGCCGGAGAA CTACCAGGCC GTGCTGAGGC AAGAGTTTTG CCTGATCCGG AAGAAGAACT ACGCTTACTA TTTCCTCACC GTGCACGATA TCGTGCGCCA CGCTCGCAGC CTCGATCCGC CGATCCTGTG CCAGGGCAGA GGCAGTGCGG CCAATTCGCT GGTCTGCTAT TTCCTGGGTA TCACCCCGAT CGACCCGGTG CGCGAACAGC TGCTGTTCTC GCGTTTCCTG TCCGAAATGC GCGACGAGCC GCCCGATATC GACGTGGACT TCGAGCATGA GCGCCGCGAG GAGATCATGC AGTACATCTA TGCTCGCTAT GGCCGCGAAC GGGCGGGCAT CGCCGCCACC GTGATCCACT ATCGTCCACG CAGTGCCATC CGCGAAGTGG GCAAGGTGCT GGGGCTGACC GAGGACGTGA CGGCGCGGCT CGCGTCTACC ATCTGGGGCA ATTGGGGCAA GGATGTGCCG GAAGCCTACG TGGCGGAGGC CGGGCTGGAT ATTGCCAACC CCGTGGTCGC ACGGCTGAAG GCGCTGGTCG ATCAGTTGCT GACCTTTCCC CGCCATCTCT CGCAGCATGT CGGCGGCTTC GTACTGACCG AGTCACGGCT CGACGAACTG GTACCGATCC ACAATGCCGC GATGCCTGAC CGCACGTTCA TCGAATGGGA CAAGGACGAT ATCGACGCGC TGGGGCTGAT GAAGGTCGAC ATTCTGGCGC TGGGAATGCT CACCGCCATC CGCAAGAGCT TCGACCTGCT GCGCAGGCAC GACGTGGCAG ACCTGACGCT GGCCTCCGTC CCGCACGACG ACAGGGCGAC GTATGCTATG CTCAAGCGGG GTGACAGCAT CGGCACCTTC CAGGTCGAGA GCCGCGCGCA GATTGCCATG CTGCCCCGCA TGAAGCCCGA CTGTCTCTAC GATCTGGTCA TCCAGGTCGC GATTGTCCGC CCCGGGCCGA TTCAGGGCGG CATGGTCCAT CCCTACCTGC GGCGGCGCAA CGGCGAGGAA GCGGCGGTCT TTCCCAGTCC CGCGCCGCCT CACGATCCGC ATGAGTTGCA CAAGGTACTG GGCAAGACGC TGGGTGTGCC GCTGTTTCAG GAACAGGCGA TGAAGCTGGC CATCGTCGCG GCCCGTTTCA CCCCCGCACA GGCCGATGGC CTCAGGCGCG CAATGGCCAC GTTCCGTCAC CATGGGACGG TCCACAACTA CGAAGACCTG CTGATCGATG GCATGGTGGC ACGCGGCTAT GAACGTGAAT TCGCCGAACG CTGCTATGAA CAGATCAAGG GGTTCGGCGA ATATGGCTTT CCCGAAAGCC ACGCGCAGGC TTTCGGGTGG CTGGCCTATG TCTCGTCATG GCTCAAATGC CACCACCCGG CCGCTTTCAC CTGTGCACTG CTCAACAGTC AGCCGATGGG CTTCTATGCC CCGGCGCAAC TCGTTCGCGA TGCGCAACAA CATGGCGTGG AAGTCCGGGC GGTGGATGTG AACGCCAGTC ATTGGGATTG TACGCTGGCA GGGGGAAAGG GTGGCCCATT TGCGCTGCAG CTTGGCTTTT CCCGTATCGA CGGTTTCCGC AGGGCTTGGG CCGAAGCGTT GGCGCAGGCC CGCCACGATG GCATGTTTAC CTCGATCGAG GACGTTGCCC GCCGTGCCGA GATGCCCCCT CAGGCCTTGC GCAAGCTAGC CGATGCCGAT GCGTTCGGAT CTCTGGCATC CTCGCGCCGT GATGCGCTGT GGGATGTCCG CCGCACGCCC CCCACGCAAC TACCCTTGTT CGCCTTTGCC AATGCTCCGG AGCTCGGGCA GGAACCCGAT GCCCGCCTTT CCGCGATGCC GCTTCCCGAA GAAGTGGTGG CCGATTACCA GACCACACGG CTGTCGCTGA AGGGTCACCC CATGCAATTC CTGCGGGATG CGCTGGCCCG CAAAGGCGTG CTGTCCTGCG CTGAAGCCAA TACCGCTGCG GATGGTCGCA AGGTCCGCGT CGCCGGTGTT GTTCTTACCC GCCAGCGTCC TGGCAAGGGC AATGCGGTGT TCATCACGAT TGAAGATGAA ACCGGCGTGG TCAACGCACT GCTCTGGGCG CGCGATCTCG ACAAGCAGCG CCGCGCCGTC ATGGCCGCTC GCTTGATGCT GATCGAAGGC GAAATTCAGA AGAGCAAGGA AGGCGTGGTC CACCTTATGG CCGCGCGGAT CATGGATCGT ACCGACCTTC TCGACCGCCT GACAGATCGG GAGCAGGTGA GGCCCGCTGT TTCACGCGGC GACGAGGTCC ATCGCCCCCA ATACCCCCGC CTGCATGCGC ATCCACGCGA TGTAAGGGTC CTGCCCAAAT CCCGGGATTT TCGCTGA
|
Protein sequence | MTGSQPFAEL VSATNYSFLR GASHPAQMVA RAHQLGMAGL GVADRNTVAG VVRAHVGWRE VGGLASGFRL AVGARLVFAD GTPDVVAYPS TLRGWRRLTR LLTLGNRRAQ KGDCTLYLSD LLDHADDLLL IATGHDRRVL EKLREARPGA VWLAVTMQRG GTDARRLAQA QALGAACGVP LLATCDALYD HPDARPLHDV LTCIREGVTI AEAGQRLLAN AERHLKLPAE MARLFRSHPE AVTASVEILS RITFTLDDLR YEYPHEPVPE GWQPQAWLEH LVQQGGEALF PDGLPENYQA VLRQEFCLIR KKNYAYYFLT VHDIVRHARS LDPPILCQGR GSAANSLVCY FLGITPIDPV REQLLFSRFL SEMRDEPPDI DVDFEHERRE EIMQYIYARY GRERAGIAAT VIHYRPRSAI REVGKVLGLT EDVTARLAST IWGNWGKDVP EAYVAEAGLD IANPVVARLK ALVDQLLTFP RHLSQHVGGF VLTESRLDEL VPIHNAAMPD RTFIEWDKDD IDALGLMKVD ILALGMLTAI RKSFDLLRRH DVADLTLASV PHDDRATYAM LKRGDSIGTF QVESRAQIAM LPRMKPDCLY DLVIQVAIVR PGPIQGGMVH PYLRRRNGEE AAVFPSPAPP HDPHELHKVL GKTLGVPLFQ EQAMKLAIVA ARFTPAQADG LRRAMATFRH HGTVHNYEDL LIDGMVARGY EREFAERCYE QIKGFGEYGF PESHAQAFGW LAYVSSWLKC HHPAAFTCAL LNSQPMGFYA PAQLVRDAQQ HGVEVRAVDV NASHWDCTLA GGKGGPFALQ LGFSRIDGFR RAWAEALAQA RHDGMFTSIE DVARRAEMPP QALRKLADAD AFGSLASSRR DALWDVRRTP PTQLPLFAFA NAPELGQEPD ARLSAMPLPE EVVADYQTTR LSLKGHPMQF LRDALARKGV LSCAEANTAA DGRKVRVAGV VLTRQRPGKG NAVFITIEDE TGVVNALLWA RDLDKQRRAV MAARLMLIEG EIQKSKEGVV HLMAARIMDR TDLLDRLTDR EQVRPAVSRG DEVHRPQYPR LHAHPRDVRV LPKSRDFR
|
| |