Gene Saro_1770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1770 
Symbol 
ID3918329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1864289 
End bp1867936 
Gene Length3648 bp 
Protein Length1215 aa 
Translation table11 
GC content65% 
IMG OID640444511 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_497044 
Protein GI87199787 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.183542 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAGC AGGCCGCGCC TCTCGCAAGA GCGCGGCCTT TTTCCTACCT TGTACACATG 
CCCCATGCCG CCTTCGTCCC GCTTCGCATC TTCTCGTCCT ACACGATGCT CGACGGCGCC
ATCGACCCGA AGGCTATCGC CAAGACCGCC GCCGAGCGGG GCTTTCCCGC CGCCGCCATC
ACAGATCGCA ACGGGCTCTA CGGTAGCGTG GCCTATGCCA AGGCGTGCAA GGACATGGGC
GTCCAGCCGG TCATCGGCAC GATGCTCGCC GTAGCCCGTC CCGAACGCGA CGGCGCGGCA
ACCGGCTTCG GTCCGGCCGC ACCGACGATC GACTGGCTGG CCCTCTATGC GCAGGATGCC
AGGGGCTATG ACAACCTTTG CCACCTCGTC AGCCGCGCCC ATCTCGACCG GCCGCTCGAA
TTTGCGCCGC ACGTCGTCCT GGCGGATCTC GTCGGCCATA CCGACGGCCT CATCTGCCTT
ACCGCCGCCA GCGAAGGCGC ACTGGCGCAG CTTCTTGCCG GAGGCCAGCA AAGCGCCGCC
GAGGCCTATG TCGACCGGCT GCTCGAACTG TTCCCCGAAC GGCTCTACAT CGAGATAGCC
CGGCGCAACG ATGCGGTGGA GGAAGCGTCC GAAACCGCGC TCATCGATCT CGCATACGCG
CGGGACCTGC CGCTGGTTGC CACGAACCCC GCCTGCTTTG CAGAGCGGAC TTTCTACGAT
GCGCACGATG CCATGCTCTG CATCGCCAGT TCCACCCATG TCGATAGCGC GGATCGCCCG
CGCTCCAGCA AGGAATGGTG GATCAAGCCC GCGGCGGTCA TGGAGGAACT GTTCAAGGAC
CTGCCCGAAG CGCTCGCCAA CACGCTGGTC GTGGCGCAGC GCTGCGCGGT CATGCCGCCC
AAGCGCAAGC CGATCCTGCC CAGCCTTGCC GGCGACCAGG AAGGCGAGGC GCGGATGTGC
GCCGAGGATA GCCGCAAGGG CCTCGTCCTG CGACTTGAAC CCTATTACCC GGAAAGCACC
CACGCCGAAC TCGCTCGCGT GCTCTGCCTT GGCCCCGATG CCGAACCGGT CGCCGCCGAC
CATCCGCAAC TGGTCGAAGC GGGCGTCTGG GAAGAAGTTC TCGACTACCG CAAGCGGCTC
GAGTTCGAGA TCGCGATCAT CAACCGCATG GGGTTCGGCG GCTACTTCCT GATCGTTGCC
GACTTTATCA AGTGGGCGAA GGAACACGGC ATTCCGGTCG GGCCGGGCCG CGGTTCGGGC
GCGGGTTCGC TCGTCGCTTG GGCGCTTACC ATCACCGACC TCGATCCGAT CAAGCTTGGT
CTTCTGTTCG AACGCTTCCT GAATCCGGAA CGCGTCTCGA TGCCGGACTT CGACATCGAC
TTCTGCGAAA CCCGCCGTGG CGAGGTGATC CGCTACGTCC AGCGCAAGTA CGGCGAGGAC
CACGTCGCCC AGATCATCAC CTTCGGCAAG CTCAAGGCCC GCGCGGTGCT GCGCGACACC
GGCCGTATCC TTCAGATGAG CTATGGCCAG ACTGACCGGC TCTGCAAGAT GGTCCCGAAC
CATCCGACCG ATCCCTGGCC GCTGCCCCGC GCCCTCAATG GCGTGGCCGA ACTCAAGCGC
GAATACGACC GCGACCCCGA GGTCAAGCGC CTGATCGACC TTGCGATGCA GCTCGAAGGC
CTGCCGCGCA ACAGTTCGAC CCACGCGGCG GGCGTCGTTA TCGGCGACCG CCCGCTGGCG
CAACTCGTGC CGCTCTATCG CGATCCGCGT TCGGACATGC CGGTCACGCA GTTCGACATG
AAGCACGTCG AGGATGCCGG CCTCGTCAAG TTCGACTTCC TCGGCCTCAA GACGCTGTCG
GTCCTGCGCA AGGCCGTAGA CCTGATGAAG AAGCGCAGCA TCGAGATCGA CCTCTCGGCG
CTGCCGTGGG ACGACGAGCA AACCTACAAG CTGCTCCAGT CGGGCGACAC CGTCGGCGTT
TTCCAGCTCG AATCCGAAGG CATGCGGCGC ACGCTCGCCG CGGTGAAGCC CACCAACTTC
GGCGACATCA TCGCTCTCGT CTCGCTTTAC CGTCCGGGCC CGATGGACAA CATACCATTG
TTCGGCCGCC GCAAGAACGG GCTCGAAGCG ATCGAGTATC CGCACGACAA GCTGGCCGGG
ATCCTGTCGG AAACCTACGG CATCTTCGTC TACCAGGAAC AGGTCATGCA GGCCGCGCAG
ATCCTCGCCG GATACTCGCT CGGCGACGCC GACCTCCTGC GCCGCGCCAT GGGCAAGAAG
GTCCAGGCGG AAATGGACGC GCAGCGCCAG CGTTTCGTCG ACGGCTGCAA GGAAGTGTCC
GGCATTCCCG CCGCAAAGGC CAACGAACTT TTCGACTTGA TCGACAAGTT CGCGGGCTAC
GGCTTCAACA AGTCGCACGC AGCCGCCTAC GCGCTGCTCG CCTACCAGAC CGCCTGGCTC
AAGACGCACT ATCCGCACGA ATTCTATGCG GCGGCGATGT GCTTCGACAT GCACCAGTCG
GAAAAGCTCT CGGTCTTCGT CGACGACATG CGCCGCAACG GCGTGGCGCT GGCCGGGCCC
GATATCAACC ACTCCGAGGC CGAGTTCACC GTCGAGCGGA CGCACGACGG CTATGCCGTG
CGATATGCCC TCGCGGGCCT GCGCAACGTG GGCGAGAAGG CGATGGAGCA GATCGTCGAG
GAGCGTGAGG CCAACGGCCC GTTCGCTTCG CTGGACGATC TCTTCCGCCG CATCCCGGCG
GGTTCGATGA ACCGTCGCCA GCTCGAAGCG CTCGCGGCCG GTGGCGCGCT CGATTGCCTC
GAGCCCAACC GCGCGCAGAT CATCGCCAAT GCCGAACTGC TCATGGCCGT GGCCGAGGAG
GCCTCGCGCT CGCGCACTTC GGGGCAGGGC GGCCTTTTCG GCGGCGACGA TCACGCCACT
CCCGCCACCC GCCTCTCCGA CACGAAACCG TGGAGCCGCG CGGACCAGAT GGCGGCGGAG
CGCGAGAATT TCGGCTTCTA CTTTGCCGCG CACCCGGTCG AGGAATACCG CGCCGTCGCC
TCGGCCAATG GTGCGCGCTC CTATGGCAGC CTGATGACGA GCAGTGGGGA GCCGGGCGGA
CGCTCGGGCG CCGTCATGGC CGCGCTGGTG GAGAACGTGC AGAAGCGCAA GACGAAGAAG
GGCAAGGACT TCGTCATGGC CGACTTCTCC GACAGCTCCG GCCTGTTCTC GGCCTCCTGC
TTCGAGGAAA GCCTGGTCGA ACCGTTCCAG CAGTGGGCGA GGGAGGGGAC CTGCGTTCTC
CTCAACGTCG AACTCGACCG CCCCAATCCG GATGAGCCGC CGCGCGTCAC CGTGCGCGGC
GCCCGCCCGC TCGCCTCGGT CACCAGCGCC TCGCGCATGG TGCTCAAGCT CGATGTCAGC
AGGACGGAGG CGATTTCCGA CCTCGCGATG CTCTTGCCGC GGCGGCCTGA CGGCAAGGGC
GAGGTGCTTG CCCGCCTTCG TACCGGCGGG CCAAAGGAGC CTCTCGTGCG CCTCGGCAAT
GATTTCATAC TCGATAGTGA CTTAATCGAA CGATTGATTC CGATAGAAGG ACTCGCCAAT
GTGGCATTGA CCGCAAGACC GGAAAGGCAT CTGCGGTTGG TCGAATAG
 
Protein sequence
MKKQAAPLAR ARPFSYLVHM PHAAFVPLRI FSSYTMLDGA IDPKAIAKTA AERGFPAAAI 
TDRNGLYGSV AYAKACKDMG VQPVIGTMLA VARPERDGAA TGFGPAAPTI DWLALYAQDA
RGYDNLCHLV SRAHLDRPLE FAPHVVLADL VGHTDGLICL TAASEGALAQ LLAGGQQSAA
EAYVDRLLEL FPERLYIEIA RRNDAVEEAS ETALIDLAYA RDLPLVATNP ACFAERTFYD
AHDAMLCIAS STHVDSADRP RSSKEWWIKP AAVMEELFKD LPEALANTLV VAQRCAVMPP
KRKPILPSLA GDQEGEARMC AEDSRKGLVL RLEPYYPEST HAELARVLCL GPDAEPVAAD
HPQLVEAGVW EEVLDYRKRL EFEIAIINRM GFGGYFLIVA DFIKWAKEHG IPVGPGRGSG
AGSLVAWALT ITDLDPIKLG LLFERFLNPE RVSMPDFDID FCETRRGEVI RYVQRKYGED
HVAQIITFGK LKARAVLRDT GRILQMSYGQ TDRLCKMVPN HPTDPWPLPR ALNGVAELKR
EYDRDPEVKR LIDLAMQLEG LPRNSSTHAA GVVIGDRPLA QLVPLYRDPR SDMPVTQFDM
KHVEDAGLVK FDFLGLKTLS VLRKAVDLMK KRSIEIDLSA LPWDDEQTYK LLQSGDTVGV
FQLESEGMRR TLAAVKPTNF GDIIALVSLY RPGPMDNIPL FGRRKNGLEA IEYPHDKLAG
ILSETYGIFV YQEQVMQAAQ ILAGYSLGDA DLLRRAMGKK VQAEMDAQRQ RFVDGCKEVS
GIPAAKANEL FDLIDKFAGY GFNKSHAAAY ALLAYQTAWL KTHYPHEFYA AAMCFDMHQS
EKLSVFVDDM RRNGVALAGP DINHSEAEFT VERTHDGYAV RYALAGLRNV GEKAMEQIVE
EREANGPFAS LDDLFRRIPA GSMNRRQLEA LAAGGALDCL EPNRAQIIAN AELLMAVAEE
ASRSRTSGQG GLFGGDDHAT PATRLSDTKP WSRADQMAAE RENFGFYFAA HPVEEYRAVA
SANGARSYGS LMTSSGEPGG RSGAVMAALV ENVQKRKTKK GKDFVMADFS DSSGLFSASC
FEESLVEPFQ QWAREGTCVL LNVELDRPNP DEPPRVTVRG ARPLASVTSA SRMVLKLDVS
RTEAISDLAM LLPRRPDGKG EVLARLRTGG PKEPLVRLGN DFILDSDLIE RLIPIEGLAN
VALTARPERH LRLVE