Gene Saro_0278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0278 
Symbol 
ID3917631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp296295 
End bp299132 
Gene Length2838 bp 
Protein Length945 aa 
Translation table11 
GC content67% 
IMG OID640443007 
ProductDNA polymerase I 
Protein accessionYP_495560 
Protein GI87198303 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGAAA AGCAGCATCT CTACCTGGTC GACGGTTCGG CCTACATCTT CCGCGCCTAT 
CACCGCCTGC CCCCGCTGAC GAACCCGCGC GGCGTGCCTG TCGGCGCGGT CTATGGCTAT
ACGACGATGC TGTGGAAACT GGCCGAGGAC CTGCACAAGG CGGACGGGCC GACGCACCTG
GCGGTCATCC TCGACAAGGC GGGCACCTCG TTCCGCAACG ACCTGTACGA CCAGTACAAG
GCCAATCGTC CGCCGCCGCC CGAAGATCTC GTCCCGCAGT TCCCGCTGAT CCGCGATGCC
ACGCGCGCCT TCAGCCTCGC CTGCATCGAG GAGGACGCGC TCGAGGCGGA CGACCTCATC
GCCTCCTACG CCCGCGCCGC CACGCTGCGC GGCTGGGACG TGACCATCGT TTCATCGGAC
AAGGACCTGA TGCAGCTTGT CGGCAAGTGC GCGACCGGTG GCGGGTGCAT CGACATGCTC
GACACGATGA AGAACCAGCG CATCGACATT CCCGAGGTGG TCGAGAAGTT CGGCGTGCCC
CCGGAAAAGG TCGGCGACGT GCTGGCGCTG ATGGGCGATT CGGTCGACAA CGTGCCCGGC
ATCCGCGGCA TCGGCCCGAA GACCGCGACC AAGCTCATCC AGGAACATGG CGATCTCGAA
TCCGCGCTTG CCGCCGCGCC GACGATGAAG GCGGGCAAGC TGCGCGAAAG CCTGATCGAG
CAGGCGGAAA TGGCCCGCCT CTCGCGCGTG CTGGTGGCGC TCAAGGAAGA CTGCCCGCTG
CCGGTCCCGA TCGAGGACTT CAAGCTCGGC CAGATCCCGC GCGATCCGCT CGCCGCGTTT
CTCGAGGAAC ACGGCTTCAC CAGCCTGCTG CGGCGGCTGG ACGACGGCAA GGGCAGCCCC
GGGCCGAAGA CCCAGCTCAA CCCCGCCAAG CCCGACAACA AGGCTGCGCC CGCCGCGCCC
GAAGGCAACC GCCAGGCGCT ACCCTCGTGG CCCGCGCCCG ACCTCTCGGC CTATGCCTGC
GTCCAGTCGA TGGGCGAACT GCAGCAGTGG ATCGCCCGCG CCTTCGAAGC GCGGGTCGTC
GCCATCGATA CCGAGACCAG CGCGCTCGAC GCGATGCGCG CCGACCTGGT GGGCGTAAGC
CTCGCGCTGG GCCCCAACGA TGCGTGCTAC ATCCCGCTTG GCCACGGCGG CTCCGACATG
TTCGCGGAAA AGCCCCTGCA GGTGGACCGT GCCGAGGCGC TCGCCGCGCT CAAGCCGCTG
CTGGAAAGCG ACGCGGTGCT CAAGGTCGGC CAGAACATCA AGTACGACCT CAACATCCTC
GCCCGGCAGG GGATCATGGT GTCACCCATC GACGACACGA TGGTGATGAG CTTCTGCCTC
GATGCGGGGC GCAGCGAGAC AGGTCTTGCC GGGCACGGGA TGGACGAGCT GTCCGAGCGG
CACCTGCTGC ACACGACGAT GAAGTTCAAG GAGCTGTGCG GCACCGGCAA GAAGGCGATC
TCCTTTGCCG AAGTGCCGCT GGCCGACGCC ACGCGCTATG CCGCCGAGGA CGCGGACGTG
ACGTGGCGCC TGCACCGCCT GTTCCAGCCG CGGCTGGCCG GGGAAGGCGG CACGCGCATC
TACCAGAAGG TCGACCGGCC GCTGATCCCG GTCGTCGCGA TGATGGAACG CAACGGCATC
AAGGTCGACC GCGAGCGACT GTCCGGCCTG TCGACCGAGT TCGCCGCCCA GATCGCGCGC
CTCGAAGGCG TGATCCACGA GAAGTGCGGC ATGTCCTTCA CCATCGGCAG CCCCAAGCAG
CTGGGCGAAG TGCTGTTCGA CAAGCTGGGC TACAAGGGCG GCAAGAAGGG CAAGACCGGG
CAGTATTCGA CCGACCAGTC GGTGCTGGAA GGCCTTGCCG CGCAGGGCGC CGAAGTGGCG
ACACTGGTGC TGGAATGGCG CCAGCTCTCG AAGCTCAAGT CGACCTATAC CGACGCGCTG
CAGGCCGCGA TCAACCCGCA CACCGGCCGC GTCCACACCA GCTACAGCCT GGTGGGCGCG
CAGACCGGCC GCCTGTCTTC AAACGAGCCT AACCTCCAGA ACATCCCGAT CCGCACCGAG
ATAGGCCGCC AGATCCGCGA TGCCTTCGTC GCCGAGCCCG GAAACGTGCT GCTCGCGGCG
GACTACAGCC AGATCGAACT GCGCCTTGCC GCGCACATGG CCGACGTGCC CGCGCTCAAG
GAGGCGTTCG CCGCCGGCGA GGACATTCAC GCGCGTACTG CGATGGAAAT GTTCGGCACG
GTCGACCGCG ATACGCGCGG ACGGGCCAAG ACGATCAACT TCGCTATCCT CTACGGCATC
AGCCGGTGGG GCCTCGGCGG ACGGCTGGGC GTCAGCGCGG ACGAGGCGCA GGCGATGATC
GACCGCTACT TCGAACGCTT CCCCGGCATC CAGCGCTACA TCCACGAGAC ATTGGAGAGT
GTGCGCGAGC GCGGCTATTC CGAGACGCTG TTCGGCCGCA AGACCTGGTT CCCGCGCATA
AGCTCGAAGA ACCAGGCCGA ACGCCAGGGC AGCGAGCGCG CTGCGATCAA CGCGCCGATC
CAGGGCACCA GCGCCGACAT CATCAAGCGC GCGATGGCCC GCATGATGCC CGCGCTGACC
GAAGCGGGCC TCGGCCACGT GCGCATGCTG CTGCAGGTCC ACGACGAACT CGTGTTCGAA
CTGCCCCAGG GCGACGTCGA GGCGGCGAAG CCCGTGATCG AGCGCGTGAT GGCGACGGCG
GCAGAACCTG CGGTCAAGCT CAGCGTGCCG CTGGGCGTCG AGATCGGCAC CGGCCTGTCG
TGGGGCGCGG CGCACTGA
 
Protein sequence
MDEKQHLYLV DGSAYIFRAY HRLPPLTNPR GVPVGAVYGY TTMLWKLAED LHKADGPTHL 
AVILDKAGTS FRNDLYDQYK ANRPPPPEDL VPQFPLIRDA TRAFSLACIE EDALEADDLI
ASYARAATLR GWDVTIVSSD KDLMQLVGKC ATGGGCIDML DTMKNQRIDI PEVVEKFGVP
PEKVGDVLAL MGDSVDNVPG IRGIGPKTAT KLIQEHGDLE SALAAAPTMK AGKLRESLIE
QAEMARLSRV LVALKEDCPL PVPIEDFKLG QIPRDPLAAF LEEHGFTSLL RRLDDGKGSP
GPKTQLNPAK PDNKAAPAAP EGNRQALPSW PAPDLSAYAC VQSMGELQQW IARAFEARVV
AIDTETSALD AMRADLVGVS LALGPNDACY IPLGHGGSDM FAEKPLQVDR AEALAALKPL
LESDAVLKVG QNIKYDLNIL ARQGIMVSPI DDTMVMSFCL DAGRSETGLA GHGMDELSER
HLLHTTMKFK ELCGTGKKAI SFAEVPLADA TRYAAEDADV TWRLHRLFQP RLAGEGGTRI
YQKVDRPLIP VVAMMERNGI KVDRERLSGL STEFAAQIAR LEGVIHEKCG MSFTIGSPKQ
LGEVLFDKLG YKGGKKGKTG QYSTDQSVLE GLAAQGAEVA TLVLEWRQLS KLKSTYTDAL
QAAINPHTGR VHTSYSLVGA QTGRLSSNEP NLQNIPIRTE IGRQIRDAFV AEPGNVLLAA
DYSQIELRLA AHMADVPALK EAFAAGEDIH ARTAMEMFGT VDRDTRGRAK TINFAILYGI
SRWGLGGRLG VSADEAQAMI DRYFERFPGI QRYIHETLES VRERGYSETL FGRKTWFPRI
SSKNQAERQG SERAAINAPI QGTSADIIKR AMARMMPALT EAGLGHVRML LQVHDELVFE
LPQGDVEAAK PVIERVMATA AEPAVKLSVP LGVEIGTGLS WGAAH