Gene Sala_0625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0625 
Symbol 
ID4082561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp635390 
End bp638203 
Gene Length2814 bp 
Protein Length937 aa 
Translation table11 
GC content66% 
IMG OID638008984 
ProductDNA polymerase I 
Protein accessionYP_615679 
Protein GI103486118 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.480617 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGA AGAATCACCT CTATCTGGTC GATGGCTCCA GCTATATCTT TCGCGCCTAT 
CACCGCCTGC CGCCGCTGAC CAACCCCAGG GGCGTGCCGG TCGGTGCGGT TTACGGCTAC
ACCACGATGC TGTGGAAGCT CGCGAAGGAT CTGCACGACG CGGACGGGCC GACGCACCTT
GCGGTGATCC TCGACCATTC GAGCGAGTCG TTCCGCAACG AGATTTACGA CCAGTATAAG
GCGAACCGCC CCGACCCGCC CGAGGATCTG GTCCCGCAAT TCCCGCTGAT CCGCGACGCG
ACGCGCGCTT TCTCCCTGCC GTGCATCGAG ATGGAGGGGT TCGAGGCCGA CGATCTGATT
GCGAGCTATA CCGAAGCTGC GGTGCGCGAA GGGTGGGACG TCACCATCGT GTCGTCGGAC
AAGGATCTGA TGCAATTGAT CCGCGAGCCC GCAGGCGGCC CGCATGTCGA CATGCTCGAC
ACGATGAAGA ATGTCCGGCT GGGGATCGAC GCGGTGAACG AGAAGTTCGG CGTCACCCCC
GATCTGGTCG GCGACGTGCT CGCGCTGATG GGCGACAGCG TCGACAATGT TCCCGGCGTG
CGCGGCGTGG GGCCGAAGAC CGCGACGAAG CTGATCCAGG AATATGGCAG CCTGACCGCC
GCGCTCGACG GCGCCGAAAC GATGAAGCCC GGCAAGCTGC GCGAGAATCT GATCGAACAT
CGTGCGATGG CGGAGCTTTC GCGCATCCTG GTCGACCTCA AGCGCGATTG TCCGCTGCCG
GACCCGCTCG ACGCGCTCAA GCTTGGCGCG ATCCCGCCCG AACCGCTCAA GCTGTTCCTC
GACGAACATG GTTTCCGTTC GCTGTCGGCG AAGCTCGATC TCGGCACCGC GCCCGCGGGG
CCGCCGACGC TGCCGCGTGC GGGGGCAGCG CCCGTGACGC CCGCTGCCGA TGCGCCTTCG
ACCCCGACAT TGCCGTCGAT GCCGCCGATC GACCGCGCGC GCTATGAAAC GGTGACGACG
ATCGAGGCGC TCGACCGCTG GATCGCCGAC GCGCGCGCGG CGCATGTCGT CGCGGTCGAC
ACCGAGACCG CGAGCCTGGA CAGCGTTACC GGGCGGCTCG TCGGGGTGAG CCTGTCGACC
GGGGCGGGCA AGGCCTGTTA CATTCCGCTC GGTCACGGCG GCACCGACAT GTTCGCCGAA
AAGCCCGAAC AGATCGCGAT GGGCGACGCG CTGGAGCGCC TCGGCGCGCT GTTTGCCGAC
GATGCGGTGC TCAAGGTCGG GCACAACCTC AAATATGACA TTGGCGTGCT CGCGCAGCAC
GGGGTCACCG TCGCGCCCTA TGACGACACG CTGCTGATGA GCTTTGCGCT CGACGCGGGC
AAGCACCAGC ACGGGCTCGA CGAGCTTGCC AAGCTGCACC TCGACCATGT CTGCCTGTCG
TTCAAGGACG TGTGCGGCAC TGGCAAGTCG CAGATCAGCT TCGCCGAAGT GCAACTCGAC
CGCGCGACCG AATATGCCGC CGAAGATGCC GAGGTCGCGT GGCGGCTGTG GAAGCTGCTC
AAGCTCCGCC TGCCGCTCGA AGGCGGGACG CGCGTCTACG AGATGGTCGA CAGGCCGCTG
GCCGCGGTCG TCGAGGGCAT GGAACGCGCC GGCATCATGG TCGACCGCGA CTATCTGGCC
AAGCTGTCGG GCGAGTTTGC GAACGAGATG CTGCGCATCG AGGGCGAAAT CCACGCCCTC
GCAGGTCAGC CCTTTGCGAT CGGCAGCCCC AGGCAGCTCG GCGAAATCCT GTTCGACAAG
ATGGGCCTCA AGGGCGGGCG CAAGGGCAAG TCGGGCGACT GGTCGACCGA CCAGAATGAG
CTGGAACGGC TCGAACGCGA CGGCGTACCG ATTGCGCGCA AAATCCTCGA ATGGCGCCAG
CTCGCCAAGC TGAAATCGAC CTATACCGAC GCCTTGCAGG AACAGGTGAA CGCCACGACC
GGGCGCGTCC ACACCAGCTA CAGTCTCGTC GGCGCGCAGA CGGGGCGACT GTCATCGACC
GATCCGAACC TTCAGAATAT CCCGATCCGC ACCGAAGTCG GGCGGCAGAT CCGCGACGCC
TTCATCGCCG CGCCGGGCCA TGTGCTGATT GCCGCCGACT ATAGCCAGAT CGAATTGCGG
CTCGCGGCGC ATATGGCCGA TGTCCCCGAG CTGAAAGAGG CCTTCGCCCG CGGCGACGAC
ATTCACGCCG CGACCGCGAT CGAGCTGTTC GGCGAGGTCA ACCGCGACAC GCGCGGCAAG
GCGAAGACGG TCAATTTCTC GATCCTCTAT GGCATTTCGC GCTGGGGTCT CGCCGGACGG
CTCGAAATCA CCCCCGACGA GGCGCAGGCG CTCATCAGCC GCTATTTCGA GCGCTTCCCC
GGCATCTCGG ACTATATCAG CGACACGCTC GAAACCGCGC GCGCGCGCGG CTATACCGAG
ACCTTGTTCG GCCGGAAGAC CTGGTTCCCG CGCATCAAGG CGGCGAACCA GAACGAGCGC
GCGGGAAGCG AGCGCGCCGC GATCAACGCG CCGATCCAGG GCACGAGCGC CGACCTGATC
AAGCGCGCGA TGGCGCGGAT GCCGGGCGCG CTTGCGGATG CGGGCCTCGC GGATGTCAAG
ATGCTGCTTC AGGTCCATGA CGAACTGGTG TTCGAGGCGC CCGAGGACAA GGCCGCAGCG
GCGGGCGAGG TGATCCGCGC GGTGATGATG GGCGCCGCCG AGCCGGCGCT CAAACTCTCG
GTCCCGCTGG AGGTCGAGAT CGGCACAGGT AAGAGCTGGG GCGACGCGCA TTGA
 
Protein sequence
MSEKNHLYLV DGSSYIFRAY HRLPPLTNPR GVPVGAVYGY TTMLWKLAKD LHDADGPTHL 
AVILDHSSES FRNEIYDQYK ANRPDPPEDL VPQFPLIRDA TRAFSLPCIE MEGFEADDLI
ASYTEAAVRE GWDVTIVSSD KDLMQLIREP AGGPHVDMLD TMKNVRLGID AVNEKFGVTP
DLVGDVLALM GDSVDNVPGV RGVGPKTATK LIQEYGSLTA ALDGAETMKP GKLRENLIEH
RAMAELSRIL VDLKRDCPLP DPLDALKLGA IPPEPLKLFL DEHGFRSLSA KLDLGTAPAG
PPTLPRAGAA PVTPAADAPS TPTLPSMPPI DRARYETVTT IEALDRWIAD ARAAHVVAVD
TETASLDSVT GRLVGVSLST GAGKACYIPL GHGGTDMFAE KPEQIAMGDA LERLGALFAD
DAVLKVGHNL KYDIGVLAQH GVTVAPYDDT LLMSFALDAG KHQHGLDELA KLHLDHVCLS
FKDVCGTGKS QISFAEVQLD RATEYAAEDA EVAWRLWKLL KLRLPLEGGT RVYEMVDRPL
AAVVEGMERA GIMVDRDYLA KLSGEFANEM LRIEGEIHAL AGQPFAIGSP RQLGEILFDK
MGLKGGRKGK SGDWSTDQNE LERLERDGVP IARKILEWRQ LAKLKSTYTD ALQEQVNATT
GRVHTSYSLV GAQTGRLSST DPNLQNIPIR TEVGRQIRDA FIAAPGHVLI AADYSQIELR
LAAHMADVPE LKEAFARGDD IHAATAIELF GEVNRDTRGK AKTVNFSILY GISRWGLAGR
LEITPDEAQA LISRYFERFP GISDYISDTL ETARARGYTE TLFGRKTWFP RIKAANQNER
AGSERAAINA PIQGTSADLI KRAMARMPGA LADAGLADVK MLLQVHDELV FEAPEDKAAA
AGEVIRAVMM GAAEPALKLS VPLEVEIGTG KSWGDAH