Gene SeD_A4388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4388 
Symbol 
ID6875628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4233082 
End bp4235868 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content54% 
IMG OID642787311 
ProductDNA polymerase I 
Protein accessionYP_002217922 
Protein GI198245003 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00514081 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCAGA TCCCAGAAAA CCCACTTATT CTCGTAGATG GCTCATCCTA TCTCTATCGC 
GCCTATCATG CGTTTCCGCC GTTAACCAAC AGCGCGGGAG AACCTACGGG CGCAATGTAT
GGTGTCCTCA ACATGTTGCG CAGCCTGATC ATGCAGTATC AGCCGACGCA TGCTGCGGTG
GTGTTTGACG CCAAAGGAAA AACCTTCCGT GATGAGCTCT TTGAACACTA CAAATCGCAT
CGTCCTCCGA TGCCGGATGA TCTGCGAGCG CAAATAGAGC CGTTACATGC CATGGTTAAA
GCCATGGGGT TACCTCTGCT GGCAGTCTCT GGCGTAGAAG CGGATGACGT TATCGGTACA
CTGGCGCGAG AAGCGGAGAA GGTGGGGCGT CCGGTATTAA TCAGCACCGG CGATAAAGAT
ATGGCACAGT TGGTGACGCC GAATATTACG CTGATCAACA CCATGACTAA CACCATCCTC
GGCCCGGATG AAGTCGTTAA TAAGTACGGC GTGCCGCCTG AGCTGATTAT CGACTTTCTG
GCGCTGATGG GGGACTCCTC GGATAATATT CCAGGCGTAC CAGGCGTGGG TGAGAAGACG
GCGCAAGCCT TGCTTCAGGG ATTGGGCGGC CTGGATACGC TGTACGCCGA GCCGGAAAAA
ATTGCCGGTC TCACTTTCCG CGGCGCCAAA ACGATGGCCG GTAAATTAGC GCAGAATAAA
GACGTAGCGT ACCTGTCTTA TAAACTCGCC ACCATTAAAA CGGATGTTGA GCTGGAGCTG
ACCTGCGAAC AGCTTGAAGT GCAGCAGCCG ATTGCGGATG AACTGCTGGG CCTGTTTAAA
AAATATGAGT TCAAGCGCTG GACGGCGGAC GTCGAGGCAG GCAAGTGGCT ACAGGCAAAG
GGCGCGAAAC CGGCGGCCAA ACCGCAGGAA ACGGTCGTTA TTGATGAATC GCCCAGCGAA
CCGGCAGCGG CGCTCTCTTA TGAAAATTAT GTCACGATTC TGGACGACGT TACGCTGGAA
AGCTGGATTG AAAAGCTGAA AAAAGCGCCA GTTTTTGCTT TCGACACGGA GACCGACAGT
CTGGATAATA TCGCCGCCAA CCTGGTGGGC CTCTCGTTTG CTATCGAACC TGGCGTTGCC
GCGTATGTAC CTGTCGCGCA TGATTATCTG GACGCTCCGG ATCAAATCTC CCGCCAGCGT
GCTCTGGAAC TGCTGAAGCC GCTGCTGGAA GATGAAAAAG TGCGCAAAGT GGGGCAAAAC
CTCAAGTACG ATCGCGGCGT CTTGCAAAAT TACGGTATTG AGCTGCGCGG TATCGCCTTC
GATACCATGC TTGAGTCTTA CATTCTGAAC AGCGTCGCCG GACGCCATGA TATGGACAGT
TTGTCCGATC GTTGGCTGAA GCACAAAACT ATCACCTTTG AAGACATTGC CGGTAAAGGT
AAAAACCAGC TCACCTTTAA CCAGATCGCA CTGGAGGAAG CGGGGCGCTA TGCGGCAGAA
GATGCGGATG TCACTTTACA GTTGCATCTC AAAATGTGGC CTGAGCTCCA GCAGCACAAA
GGCCCGCTGA ATGTTTTCGA AAACATCGAA ATGCCGTTGG TGCCTGTACT GTCACGCGTT
GAGCGCAATG GCGTAAAAAT CGATCCTGCC GTCCTGCACA AACATTCGGA AGAAATCACG
CTACGTCTGG CGGAACTGGA AAAGAAAGCG CATGACATTG CGGGCGAGGC GTTCAACCTG
TCCTCGACGA AGCAGTTGCA GACCATCCTG TTTGAAAAGC AGGGTATTAA GCCGCTGAAG
AAAACGCCTG GCGGCGCGCC GTCAACGTCG GAAGAGGTGC TGGAAGAGCT GGCGCTGGAC
TATCCGCTGC CGAAAGTGAT TCTGGAGTAT CGTGGTCTGG CGAAGCTAAA ATCCACCTAT
ACCGATAAGC TGCCGCTGAT GATTAACCCG AAAACCGGGC GCGTCCATAC GTCCTATCAT
CAGGCGGTAA CGGCGACGGG ACGTTTATCG TCCACCGATC CGAACCTGCA AAATATTCCG
GTGCGCAATG AAGAAGGCCG CCGCATTCGT CAGGCATTTA TTGCGCCTGA GGATTATCTC
ATCGTGTCTG CGGACTATTC ACAGATTGAG CTGCGTATTA TGGCGCATCT TTCCCGTGAT
AAAGGACTGC TCACGGCGTT CGCCGAAGGG AAGGATATTC ACCGCGCAAC GGCGGCGGAA
GTCTTTGGCT TGCCGCTGGA TAGCGTGACC GGGGAACAGC GCCGAAGTGC GAAAGCCATT
AACTTTGGCC TGATTTACGG GATGAGCGCC TTCGGTCTTT CTCGCCAGCT TAATATTCCG
CGTAAAGAGG CGCAGAAGTA TATGGATCTC TACTTCGAAC GCTACCCTGG CGTGCTGGAA
TATATGGAGC GCACCCGCGC TCAGGCAAAA GAACAAGGCT ATGTGGAAAC GCTGGAGGGA
CGCCGCCTTT ACCTGCCGGA TATTAAATCT AGCAACGCGG CGCGGCGCGC GGGGGCGGAA
CGCGCGGCGA TCAATGCTCC CATGCAAGGA ACGGCTGCCG ATATCATCAA GCGCGCCATG
ATTGCCGTCG ATGCCTGGCT ACAGGCCGAG CAGCCACGCG TGCGGATGAT TATGCAGGTA
CACGATGAAT TAGTGTTCGA GGTGCATAAA GACGACTTAG ATGCGGTAGC AAAACGTATC
CATCAGTTGA TGGAAAACTG CACGCGTATT GATGTGCCGT TGCTGGTGGA AGTCGGTAGC
GGAGAAAATT GGGATCAAGC GCACTAA
 
Protein sequence
MVQIPENPLI LVDGSSYLYR AYHAFPPLTN SAGEPTGAMY GVLNMLRSLI MQYQPTHAAV 
VFDAKGKTFR DELFEHYKSH RPPMPDDLRA QIEPLHAMVK AMGLPLLAVS GVEADDVIGT
LAREAEKVGR PVLISTGDKD MAQLVTPNIT LINTMTNTIL GPDEVVNKYG VPPELIIDFL
ALMGDSSDNI PGVPGVGEKT AQALLQGLGG LDTLYAEPEK IAGLTFRGAK TMAGKLAQNK
DVAYLSYKLA TIKTDVELEL TCEQLEVQQP IADELLGLFK KYEFKRWTAD VEAGKWLQAK
GAKPAAKPQE TVVIDESPSE PAAALSYENY VTILDDVTLE SWIEKLKKAP VFAFDTETDS
LDNIAANLVG LSFAIEPGVA AYVPVAHDYL DAPDQISRQR ALELLKPLLE DEKVRKVGQN
LKYDRGVLQN YGIELRGIAF DTMLESYILN SVAGRHDMDS LSDRWLKHKT ITFEDIAGKG
KNQLTFNQIA LEEAGRYAAE DADVTLQLHL KMWPELQQHK GPLNVFENIE MPLVPVLSRV
ERNGVKIDPA VLHKHSEEIT LRLAELEKKA HDIAGEAFNL SSTKQLQTIL FEKQGIKPLK
KTPGGAPSTS EEVLEELALD YPLPKVILEY RGLAKLKSTY TDKLPLMINP KTGRVHTSYH
QAVTATGRLS STDPNLQNIP VRNEEGRRIR QAFIAPEDYL IVSADYSQIE LRIMAHLSRD
KGLLTAFAEG KDIHRATAAE VFGLPLDSVT GEQRRSAKAI NFGLIYGMSA FGLSRQLNIP
RKEAQKYMDL YFERYPGVLE YMERTRAQAK EQGYVETLEG RRLYLPDIKS SNAARRAGAE
RAAINAPMQG TAADIIKRAM IAVDAWLQAE QPRVRMIMQV HDELVFEVHK DDLDAVAKRI
HQLMENCTRI DVPLLVEVGS GENWDQAH