Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sala_0625 |
Symbol | |
ID | 4082561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingopyxis alaskensis RB2256 |
Kingdom | Bacteria |
Replicon accession | NC_008048 |
Strand | - |
Start bp | 635390 |
End bp | 638203 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638008984 |
Product | DNA polymerase I |
Protein accession | YP_615679 |
Protein GI | 103486118 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.480617 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGAGA AGAATCACCT CTATCTGGTC GATGGCTCCA GCTATATCTT TCGCGCCTAT CACCGCCTGC CGCCGCTGAC CAACCCCAGG GGCGTGCCGG TCGGTGCGGT TTACGGCTAC ACCACGATGC TGTGGAAGCT CGCGAAGGAT CTGCACGACG CGGACGGGCC GACGCACCTT GCGGTGATCC TCGACCATTC GAGCGAGTCG TTCCGCAACG AGATTTACGA CCAGTATAAG GCGAACCGCC CCGACCCGCC CGAGGATCTG GTCCCGCAAT TCCCGCTGAT CCGCGACGCG ACGCGCGCTT TCTCCCTGCC GTGCATCGAG ATGGAGGGGT TCGAGGCCGA CGATCTGATT GCGAGCTATA CCGAAGCTGC GGTGCGCGAA GGGTGGGACG TCACCATCGT GTCGTCGGAC AAGGATCTGA TGCAATTGAT CCGCGAGCCC GCAGGCGGCC CGCATGTCGA CATGCTCGAC ACGATGAAGA ATGTCCGGCT GGGGATCGAC GCGGTGAACG AGAAGTTCGG CGTCACCCCC GATCTGGTCG GCGACGTGCT CGCGCTGATG GGCGACAGCG TCGACAATGT TCCCGGCGTG CGCGGCGTGG GGCCGAAGAC CGCGACGAAG CTGATCCAGG AATATGGCAG CCTGACCGCC GCGCTCGACG GCGCCGAAAC GATGAAGCCC GGCAAGCTGC GCGAGAATCT GATCGAACAT CGTGCGATGG CGGAGCTTTC GCGCATCCTG GTCGACCTCA AGCGCGATTG TCCGCTGCCG GACCCGCTCG ACGCGCTCAA GCTTGGCGCG ATCCCGCCCG AACCGCTCAA GCTGTTCCTC GACGAACATG GTTTCCGTTC GCTGTCGGCG AAGCTCGATC TCGGCACCGC GCCCGCGGGG CCGCCGACGC TGCCGCGTGC GGGGGCAGCG CCCGTGACGC CCGCTGCCGA TGCGCCTTCG ACCCCGACAT TGCCGTCGAT GCCGCCGATC GACCGCGCGC GCTATGAAAC GGTGACGACG ATCGAGGCGC TCGACCGCTG GATCGCCGAC GCGCGCGCGG CGCATGTCGT CGCGGTCGAC ACCGAGACCG CGAGCCTGGA CAGCGTTACC GGGCGGCTCG TCGGGGTGAG CCTGTCGACC GGGGCGGGCA AGGCCTGTTA CATTCCGCTC GGTCACGGCG GCACCGACAT GTTCGCCGAA AAGCCCGAAC AGATCGCGAT GGGCGACGCG CTGGAGCGCC TCGGCGCGCT GTTTGCCGAC GATGCGGTGC TCAAGGTCGG GCACAACCTC AAATATGACA TTGGCGTGCT CGCGCAGCAC GGGGTCACCG TCGCGCCCTA TGACGACACG CTGCTGATGA GCTTTGCGCT CGACGCGGGC AAGCACCAGC ACGGGCTCGA CGAGCTTGCC AAGCTGCACC TCGACCATGT CTGCCTGTCG TTCAAGGACG TGTGCGGCAC TGGCAAGTCG CAGATCAGCT TCGCCGAAGT GCAACTCGAC CGCGCGACCG AATATGCCGC CGAAGATGCC GAGGTCGCGT GGCGGCTGTG GAAGCTGCTC AAGCTCCGCC TGCCGCTCGA AGGCGGGACG CGCGTCTACG AGATGGTCGA CAGGCCGCTG GCCGCGGTCG TCGAGGGCAT GGAACGCGCC GGCATCATGG TCGACCGCGA CTATCTGGCC AAGCTGTCGG GCGAGTTTGC GAACGAGATG CTGCGCATCG AGGGCGAAAT CCACGCCCTC GCAGGTCAGC CCTTTGCGAT CGGCAGCCCC AGGCAGCTCG GCGAAATCCT GTTCGACAAG ATGGGCCTCA AGGGCGGGCG CAAGGGCAAG TCGGGCGACT GGTCGACCGA CCAGAATGAG CTGGAACGGC TCGAACGCGA CGGCGTACCG ATTGCGCGCA AAATCCTCGA ATGGCGCCAG CTCGCCAAGC TGAAATCGAC CTATACCGAC GCCTTGCAGG AACAGGTGAA CGCCACGACC GGGCGCGTCC ACACCAGCTA CAGTCTCGTC GGCGCGCAGA CGGGGCGACT GTCATCGACC GATCCGAACC TTCAGAATAT CCCGATCCGC ACCGAAGTCG GGCGGCAGAT CCGCGACGCC TTCATCGCCG CGCCGGGCCA TGTGCTGATT GCCGCCGACT ATAGCCAGAT CGAATTGCGG CTCGCGGCGC ATATGGCCGA TGTCCCCGAG CTGAAAGAGG CCTTCGCCCG CGGCGACGAC ATTCACGCCG CGACCGCGAT CGAGCTGTTC GGCGAGGTCA ACCGCGACAC GCGCGGCAAG GCGAAGACGG TCAATTTCTC GATCCTCTAT GGCATTTCGC GCTGGGGTCT CGCCGGACGG CTCGAAATCA CCCCCGACGA GGCGCAGGCG CTCATCAGCC GCTATTTCGA GCGCTTCCCC GGCATCTCGG ACTATATCAG CGACACGCTC GAAACCGCGC GCGCGCGCGG CTATACCGAG ACCTTGTTCG GCCGGAAGAC CTGGTTCCCG CGCATCAAGG CGGCGAACCA GAACGAGCGC GCGGGAAGCG AGCGCGCCGC GATCAACGCG CCGATCCAGG GCACGAGCGC CGACCTGATC AAGCGCGCGA TGGCGCGGAT GCCGGGCGCG CTTGCGGATG CGGGCCTCGC GGATGTCAAG ATGCTGCTTC AGGTCCATGA CGAACTGGTG TTCGAGGCGC CCGAGGACAA GGCCGCAGCG GCGGGCGAGG TGATCCGCGC GGTGATGATG GGCGCCGCCG AGCCGGCGCT CAAACTCTCG GTCCCGCTGG AGGTCGAGAT CGGCACAGGT AAGAGCTGGG GCGACGCGCA TTGA
|
Protein sequence | MSEKNHLYLV DGSSYIFRAY HRLPPLTNPR GVPVGAVYGY TTMLWKLAKD LHDADGPTHL AVILDHSSES FRNEIYDQYK ANRPDPPEDL VPQFPLIRDA TRAFSLPCIE MEGFEADDLI ASYTEAAVRE GWDVTIVSSD KDLMQLIREP AGGPHVDMLD TMKNVRLGID AVNEKFGVTP DLVGDVLALM GDSVDNVPGV RGVGPKTATK LIQEYGSLTA ALDGAETMKP GKLRENLIEH RAMAELSRIL VDLKRDCPLP DPLDALKLGA IPPEPLKLFL DEHGFRSLSA KLDLGTAPAG PPTLPRAGAA PVTPAADAPS TPTLPSMPPI DRARYETVTT IEALDRWIAD ARAAHVVAVD TETASLDSVT GRLVGVSLST GAGKACYIPL GHGGTDMFAE KPEQIAMGDA LERLGALFAD DAVLKVGHNL KYDIGVLAQH GVTVAPYDDT LLMSFALDAG KHQHGLDELA KLHLDHVCLS FKDVCGTGKS QISFAEVQLD RATEYAAEDA EVAWRLWKLL KLRLPLEGGT RVYEMVDRPL AAVVEGMERA GIMVDRDYLA KLSGEFANEM LRIEGEIHAL AGQPFAIGSP RQLGEILFDK MGLKGGRKGK SGDWSTDQNE LERLERDGVP IARKILEWRQ LAKLKSTYTD ALQEQVNATT GRVHTSYSLV GAQTGRLSST DPNLQNIPIR TEVGRQIRDA FIAAPGHVLI AADYSQIELR LAAHMADVPE LKEAFARGDD IHAATAIELF GEVNRDTRGK AKTVNFSILY GISRWGLAGR LEITPDEAQA LISRYFERFP GISDYISDTL ETARARGYTE TLFGRKTWFP RIKAANQNER AGSERAAINA PIQGTSADLI KRAMARMPGA LADAGLADVK MLLQVHDELV FEAPEDKAAA AGEVIRAVMM GAAEPALKLS VPLEVEIGTG KSWGDAH
|
| |