Gene Saro_1538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1538 
Symbol 
ID3917213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1584662 
End bp1587898 
Gene Length3237 bp 
Protein Length1078 aa 
Translation table11 
GC content63% 
IMG OID640444279 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_496813 
Protein GI87199556 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGGGT CCCAGCCCTT TGCCGAACTG GTCTCTGCCA CCAATTACAG CTTCCTGCGT 
GGGGCTTCGC ATCCCGCGCA GATGGTCGCG CGGGCACACC AGCTTGGCAT GGCAGGATTG
GGCGTGGCCG ATCGCAATAC CGTTGCCGGA GTGGTGCGCG CGCACGTGGG CTGGCGCGAA
GTAGGGGGAC TTGCCAGTGG CTTCCGATTG GCGGTCGGAG CCAGGCTGGT CTTTGCCGAC
GGAACGCCGG ATGTGGTGGC CTATCCCTCC ACGCTTCGTG GCTGGCGCCG GCTGACGCGG
CTGTTGACGC TTGGCAACAG GCGGGCGCAA AAAGGCGATT GCACGCTCTA CCTGTCGGAT
CTCCTGGACC ATGCCGACGA TCTTTTGCTG ATCGCCACCG GTCACGATCG GCGGGTGCTG
GAAAAACTGA GGGAAGCACG TCCCGGCGCG GTGTGGCTGG CCGTTACGAT GCAGCGTGGC
GGCACCGACG CACGGCGGTT GGCCCAGGCA CAGGCTCTGG GCGCGGCGTG TGGCGTGCCC
CTTCTGGCCA CCTGCGATGC GCTCTATGAC CATCCCGATG CGCGGCCGCT GCACGATGTT
CTGACGTGCA TTCGCGAAGG CGTGACCATT GCCGAGGCAG GGCAACGTCT GCTCGCCAAT
GCGGAACGCC ATCTCAAACT CCCTGCGGAA ATGGCTCGCC TTTTTCGGAG TCATCCCGAA
GCCGTGACGG CCAGCGTCGA GATATTGTCG CGTATCACCT TCACGCTCGA CGACCTGCGC
TACGAATATC CGCACGAGCC AGTACCCGAG GGCTGGCAAC CGCAGGCGTG GCTGGAACAT
CTGGTGCAAC AAGGGGGGGA GGCGCTGTTC CCCGATGGCT TGCCGGAGAA CTACCAGGCC
GTGCTGAGGC AAGAGTTTTG CCTGATCCGG AAGAAGAACT ACGCTTACTA TTTCCTCACC
GTGCACGATA TCGTGCGCCA CGCTCGCAGC CTCGATCCGC CGATCCTGTG CCAGGGCAGA
GGCAGTGCGG CCAATTCGCT GGTCTGCTAT TTCCTGGGTA TCACCCCGAT CGACCCGGTG
CGCGAACAGC TGCTGTTCTC GCGTTTCCTG TCCGAAATGC GCGACGAGCC GCCCGATATC
GACGTGGACT TCGAGCATGA GCGCCGCGAG GAGATCATGC AGTACATCTA TGCTCGCTAT
GGCCGCGAAC GGGCGGGCAT CGCCGCCACC GTGATCCACT ATCGTCCACG CAGTGCCATC
CGCGAAGTGG GCAAGGTGCT GGGGCTGACC GAGGACGTGA CGGCGCGGCT CGCGTCTACC
ATCTGGGGCA ATTGGGGCAA GGATGTGCCG GAAGCCTACG TGGCGGAGGC CGGGCTGGAT
ATTGCCAACC CCGTGGTCGC ACGGCTGAAG GCGCTGGTCG ATCAGTTGCT GACCTTTCCC
CGCCATCTCT CGCAGCATGT CGGCGGCTTC GTACTGACCG AGTCACGGCT CGACGAACTG
GTACCGATCC ACAATGCCGC GATGCCTGAC CGCACGTTCA TCGAATGGGA CAAGGACGAT
ATCGACGCGC TGGGGCTGAT GAAGGTCGAC ATTCTGGCGC TGGGAATGCT CACCGCCATC
CGCAAGAGCT TCGACCTGCT GCGCAGGCAC GACGTGGCAG ACCTGACGCT GGCCTCCGTC
CCGCACGACG ACAGGGCGAC GTATGCTATG CTCAAGCGGG GTGACAGCAT CGGCACCTTC
CAGGTCGAGA GCCGCGCGCA GATTGCCATG CTGCCCCGCA TGAAGCCCGA CTGTCTCTAC
GATCTGGTCA TCCAGGTCGC GATTGTCCGC CCCGGGCCGA TTCAGGGCGG CATGGTCCAT
CCCTACCTGC GGCGGCGCAA CGGCGAGGAA GCGGCGGTCT TTCCCAGTCC CGCGCCGCCT
CACGATCCGC ATGAGTTGCA CAAGGTACTG GGCAAGACGC TGGGTGTGCC GCTGTTTCAG
GAACAGGCGA TGAAGCTGGC CATCGTCGCG GCCCGTTTCA CCCCCGCACA GGCCGATGGC
CTCAGGCGCG CAATGGCCAC GTTCCGTCAC CATGGGACGG TCCACAACTA CGAAGACCTG
CTGATCGATG GCATGGTGGC ACGCGGCTAT GAACGTGAAT TCGCCGAACG CTGCTATGAA
CAGATCAAGG GGTTCGGCGA ATATGGCTTT CCCGAAAGCC ACGCGCAGGC TTTCGGGTGG
CTGGCCTATG TCTCGTCATG GCTCAAATGC CACCACCCGG CCGCTTTCAC CTGTGCACTG
CTCAACAGTC AGCCGATGGG CTTCTATGCC CCGGCGCAAC TCGTTCGCGA TGCGCAACAA
CATGGCGTGG AAGTCCGGGC GGTGGATGTG AACGCCAGTC ATTGGGATTG TACGCTGGCA
GGGGGAAAGG GTGGCCCATT TGCGCTGCAG CTTGGCTTTT CCCGTATCGA CGGTTTCCGC
AGGGCTTGGG CCGAAGCGTT GGCGCAGGCC CGCCACGATG GCATGTTTAC CTCGATCGAG
GACGTTGCCC GCCGTGCCGA GATGCCCCCT CAGGCCTTGC GCAAGCTAGC CGATGCCGAT
GCGTTCGGAT CTCTGGCATC CTCGCGCCGT GATGCGCTGT GGGATGTCCG CCGCACGCCC
CCCACGCAAC TACCCTTGTT CGCCTTTGCC AATGCTCCGG AGCTCGGGCA GGAACCCGAT
GCCCGCCTTT CCGCGATGCC GCTTCCCGAA GAAGTGGTGG CCGATTACCA GACCACACGG
CTGTCGCTGA AGGGTCACCC CATGCAATTC CTGCGGGATG CGCTGGCCCG CAAAGGCGTG
CTGTCCTGCG CTGAAGCCAA TACCGCTGCG GATGGTCGCA AGGTCCGCGT CGCCGGTGTT
GTTCTTACCC GCCAGCGTCC TGGCAAGGGC AATGCGGTGT TCATCACGAT TGAAGATGAA
ACCGGCGTGG TCAACGCACT GCTCTGGGCG CGCGATCTCG ACAAGCAGCG CCGCGCCGTC
ATGGCCGCTC GCTTGATGCT GATCGAAGGC GAAATTCAGA AGAGCAAGGA AGGCGTGGTC
CACCTTATGG CCGCGCGGAT CATGGATCGT ACCGACCTTC TCGACCGCCT GACAGATCGG
GAGCAGGTGA GGCCCGCTGT TTCACGCGGC GACGAGGTCC ATCGCCCCCA ATACCCCCGC
CTGCATGCGC ATCCACGCGA TGTAAGGGTC CTGCCCAAAT CCCGGGATTT TCGCTGA
 
Protein sequence
MTGSQPFAEL VSATNYSFLR GASHPAQMVA RAHQLGMAGL GVADRNTVAG VVRAHVGWRE 
VGGLASGFRL AVGARLVFAD GTPDVVAYPS TLRGWRRLTR LLTLGNRRAQ KGDCTLYLSD
LLDHADDLLL IATGHDRRVL EKLREARPGA VWLAVTMQRG GTDARRLAQA QALGAACGVP
LLATCDALYD HPDARPLHDV LTCIREGVTI AEAGQRLLAN AERHLKLPAE MARLFRSHPE
AVTASVEILS RITFTLDDLR YEYPHEPVPE GWQPQAWLEH LVQQGGEALF PDGLPENYQA
VLRQEFCLIR KKNYAYYFLT VHDIVRHARS LDPPILCQGR GSAANSLVCY FLGITPIDPV
REQLLFSRFL SEMRDEPPDI DVDFEHERRE EIMQYIYARY GRERAGIAAT VIHYRPRSAI
REVGKVLGLT EDVTARLAST IWGNWGKDVP EAYVAEAGLD IANPVVARLK ALVDQLLTFP
RHLSQHVGGF VLTESRLDEL VPIHNAAMPD RTFIEWDKDD IDALGLMKVD ILALGMLTAI
RKSFDLLRRH DVADLTLASV PHDDRATYAM LKRGDSIGTF QVESRAQIAM LPRMKPDCLY
DLVIQVAIVR PGPIQGGMVH PYLRRRNGEE AAVFPSPAPP HDPHELHKVL GKTLGVPLFQ
EQAMKLAIVA ARFTPAQADG LRRAMATFRH HGTVHNYEDL LIDGMVARGY EREFAERCYE
QIKGFGEYGF PESHAQAFGW LAYVSSWLKC HHPAAFTCAL LNSQPMGFYA PAQLVRDAQQ
HGVEVRAVDV NASHWDCTLA GGKGGPFALQ LGFSRIDGFR RAWAEALAQA RHDGMFTSIE
DVARRAEMPP QALRKLADAD AFGSLASSRR DALWDVRRTP PTQLPLFAFA NAPELGQEPD
ARLSAMPLPE EVVADYQTTR LSLKGHPMQF LRDALARKGV LSCAEANTAA DGRKVRVAGV
VLTRQRPGKG NAVFITIEDE TGVVNALLWA RDLDKQRRAV MAARLMLIEG EIQKSKEGVV
HLMAARIMDR TDLLDRLTDR EQVRPAVSRG DEVHRPQYPR LHAHPRDVRV LPKSRDFR