Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_0450 |
Symbol | |
ID | 5196829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 477314 |
End bp | 480139 |
Gene Length | 2826 bp |
Protein Length | 941 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640579989 |
Product | DNA polymerase I |
Protein accession | YP_001260956 |
Protein GI | 148553374 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCCCC CTGCCCCGAC GCCTGCTATC CACCCCGGCA TGTCATCCAA GCATCTCTAT CTGGTCGACG GCTCCGGCTA TATTTTCCGG GCCTATCATC GCCTGCCGCC GCTCACCAAC CGCCACGGAA TCCCGGCCGG GGCCGTCTAC GGCTTCACGA CGATGCTGTG GAAGCTGATC AACGAGCTGC ACCAGGCCGA CGGACCGACT CACCTGGCGG TGATTCTCGA CGCCTCGTCG AAGACCTTCC GCAACGAGAT GTACGACCAG TACAAGGCGC ACCGGCCGCC GCCGCCCGAA GACCTGGTGC CGCAATTCCC GCTGATCCGC GACGCGGTGC GCGCCTTCTC GGTCCCCTGC ATCGAGGAAT TGGGGCTCGA GGCCGACGAC ATCATCGCCT GTTATGCCGA AGCCGCGCTG GCGCAGGACT GGCAGGTCAC GATCGTCTCG TCCGACAAGG ACCTGATGCA GCTCATCCGG CCCGGCCTCG ACCTGCTCGA CACGATGAAC AACCGCCGCC TCGGCCGCGA GCATGTGCTG GAGAAATTCG GCGTCGAGCC GGAGGCGCTG GGCGACGTGC TCGCGCTGAT GGGCGACAGC GTCGACAATG TCCCCGGCGT CCCCGGCATC GGACCCAAGA CCGCGAGCCA GCTCATCCAG CAGTTCGGCA CCCTCGACGC GGTGCTGGCC AGCACCGACC AGATCACCAA GCCCAAGCTC AAGCAGTCGC TGATCGACCA TGCGGACAAC GCCCGGCTGT CGCGCGAGCT GGTCCGCCTG AAATGCGATG CGCCGCTGCC CGAGCCGCTC GACGGGCTGG AATTGAAGGG CCTGCCGCCC GAGCCGCTGC GCGCCTTCCT GGAGGACCAG GGCTTCAAGT CGCTGCTCGC CAAGGTGCAG GGCGACGGCC CCGCCACCCT GCCGACCGGC GGCCCGACCA TGCTGGCGCT CGCGCCCGGC GCCGAAGAGA AGCCGGCGCC CGTGATCGCG CAGGTGCCGT TCAACCATAA GGATTACGAG ACCGTCGTCG ACGAGGCGGC GCTCGACCGC TGGATCGCCG ACGGCTTCGC CTGCGGCCGG ATCGCGGTCG ACACCGAGAC CGACAATGTC GATCCGGTCC GCGCCAACCT GGTCGGGGTC AGCCTGTCCA CCGCGCCGGG CAAGGCCTGC TACATCCCGC TCGCCCATAT CGGCGACGGG CTGCTGTCGG AGACGCCCAG GCAGATCGCG ATGGACGCGG CGCTGGCGAA GCTCAAGCCC CTGCTCGAAG CCGCGCACAT CCTCAAGATC GGCCAGAACA TCAAATATGA CATGGTCGTG CTGGCCAAAT ACGGCGTCGA CGTCGCGCCC TATGACGACA CGATGCTGCT GTCCTACGAT CTCGACGCCG GGCTCGGCGG CCACGGCATG GACGAATTGG CGCAGCGCCA CCTCGACCAT GGCTGCATCG AGTTCAAGAC GGTGTGCGGC ACCGGCAAGA GCCAGATCAG CTTCGACAAG GTGACGCTCG ACGTCGCCAC CGAATATGCC GCCGAGGACG CCGACGTCAC GCTGCGCCTG TGGCAGCTGC TGAAGGGGCG GCTCGCGCCC GAGGCGGCGA CCCGCGTCTA TGAGCTGATC GACCGGCCGC TGGTGCCGGT GCTGACGGGC ATGGAGCGCG CCGGCATCAA GGTCGACCGC GAGGAACTGG CCCGGCTGTC GGCCGAGTTC TCGGGCGAGG CGACCCGGCT GGAGGCCGCG ATCCACGCCG AGGCGGGGAC GCCGTTCACC GTCGGCAGCC CCAAGCAGCT CGGCGACATA TTGTTCGACA AGATGGGCCT GAAGGGCGGC CGCAAGGGCA AGACCGGGGT CTATTCGACC GACGTCACCG AACTGGAGCG GCTCGCCGGC GACGGCGTGG CGATCGCGCG GCTGGTGCTC GACTGGCGTC AGCTCACCAA GCTGCGCTCG ACCTATACCG ACGCGCTCCA GCAGCAGATC AATCCGGCGA CGGGCCGGGT CCACACCTGC TTCTCGATGG CGGTCGCGCA GACCGGCCGG CTGTCCTCGA CCGATCCCAA CCTCCAGAAC ATCCCGATCC GCAGCGACCA TGGCCGCCGT ATCCGCGACG CCTTCGTGGC CGAGCCGGGC AAGCTCATCC TGTCGGCCGA CTATTCGCAG ATCGAGCTGC GGCTGGCGGC GCACATGGCC GACGTGCCGG CGCTGAAGGA CGCCTTCGCG CGCGGCGACG ACATCCACGC GCTGACCGCG CAGGAGGTGT TCGGCGAGGT CAACCGCGAC ACCCGCGCCC GCGCCAAGAC GATCAACTTC TCGATCCTCT ACGGCATCTC CGCCTGGGGC CTCGCCGGGC GCCTGGAGGT GTCGCGCGAG GAAGCGCAGG GGATGATCGA CCGCTATTTC TCCCGCTTCC CCGGCATCAA CCACTATATC GCCGCGACCC TCGGCCAGGT CCGCGACCAG GGCTATGTGT CGACGCTGTT CGGCCGCAAG ACCCATCTGC CGTGGATCAG GAGCGCCAAG CAGGGCGAGC GCCAGTCCGC CGAGCGCCAG GCGATCAACG CCCCGATCCA GGGCACCTCC GCCGACATCA TCAAGCGCGC GATGGTGCGG ATGAACCCCG CGCTCGCCGC CGCCGGGCTG GGCGACGTGA AGATGCTGCT CCAGGTCCAT GACGAACTGG TGTTCGAGCT GGCGCCCGAG CAGGTCGAGC CGGCATCGGC GGTGATCCGC GCGGTGATGG CGGGCGCGGC CGAGCCGATC GTCCGGCTGT CGGTGCCGCT GGGCGTCGAG ATCGGCACGG GGCCGAGCTG GGGCGCCGCG CATTGA
|
Protein sequence | MFPPAPTPAI HPGMSSKHLY LVDGSGYIFR AYHRLPPLTN RHGIPAGAVY GFTTMLWKLI NELHQADGPT HLAVILDASS KTFRNEMYDQ YKAHRPPPPE DLVPQFPLIR DAVRAFSVPC IEELGLEADD IIACYAEAAL AQDWQVTIVS SDKDLMQLIR PGLDLLDTMN NRRLGREHVL EKFGVEPEAL GDVLALMGDS VDNVPGVPGI GPKTASQLIQ QFGTLDAVLA STDQITKPKL KQSLIDHADN ARLSRELVRL KCDAPLPEPL DGLELKGLPP EPLRAFLEDQ GFKSLLAKVQ GDGPATLPTG GPTMLALAPG AEEKPAPVIA QVPFNHKDYE TVVDEAALDR WIADGFACGR IAVDTETDNV DPVRANLVGV SLSTAPGKAC YIPLAHIGDG LLSETPRQIA MDAALAKLKP LLEAAHILKI GQNIKYDMVV LAKYGVDVAP YDDTMLLSYD LDAGLGGHGM DELAQRHLDH GCIEFKTVCG TGKSQISFDK VTLDVATEYA AEDADVTLRL WQLLKGRLAP EAATRVYELI DRPLVPVLTG MERAGIKVDR EELARLSAEF SGEATRLEAA IHAEAGTPFT VGSPKQLGDI LFDKMGLKGG RKGKTGVYST DVTELERLAG DGVAIARLVL DWRQLTKLRS TYTDALQQQI NPATGRVHTC FSMAVAQTGR LSSTDPNLQN IPIRSDHGRR IRDAFVAEPG KLILSADYSQ IELRLAAHMA DVPALKDAFA RGDDIHALTA QEVFGEVNRD TRARAKTINF SILYGISAWG LAGRLEVSRE EAQGMIDRYF SRFPGINHYI AATLGQVRDQ GYVSTLFGRK THLPWIRSAK QGERQSAERQ AINAPIQGTS ADIIKRAMVR MNPALAAAGL GDVKMLLQVH DELVFELAPE QVEPASAVIR AVMAGAAEPI VRLSVPLGVE IGTGPSWGAA H
|
| |