Gene Synpcc7942_0194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0194 
Symbol 
ID3775802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp192655 
End bp195516 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content57% 
IMG OID637798600 
ProductDNA polymerase I 
Protein accessionYP_399213 
Protein GI81299005 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTAG ACTCTCCCCT CTTGCTTCTC GTGGATGGCC ACTCCTTGGC CTTCCGCAGT 
TACTACGCTT TTGCCAAAGG TCGCGATGGT GGGTTACGCA CCCGCACCGG TATTCCCACC
AGTGTCTGTT TTGGCTTCCT CAAAGCGCTG CTGGAGGTGA TGGAGCAACA GCAACCCAAA
GCGTTGGCGA TCGCCTTCGA TTTGGGTGGG CCAACCTTCC GCCACGAAGC AGACGAGAAC
TACAAGGCCA ACCGCGACGA AGCCCCCGAA GACTTCAAGA TCGATACGGA TAACCTCGTC
GCGCTGCTGC AAACCCTCAA TCTGCCAATT CTGGTGGAGC CAGGCTATGA GGCAGATGAC
CTACTCGGCA CAGTCGCCCA ACGCGGAGCC GAAGCGGGCT ATCGAGTACG GATTCTCAGC
GGCGATCGCG ATCTCTTCCA GCTCGTTGAC CCCGAGGGTG CGATTCGCGT GCTCTACCTC
GGCAATACCT TTGGCCGCAG CGCTAATCGA GAAGCCGCCC GTGAGATTGA TCCGGCCGCG
GTGATCGATA AGCTGGGTGT ACCGCCTGAG CAAGTAATTG ATTTCAAAGC GCTTTGCGGT
GATAGCTCTG ATAACATCCC TGGCGTCAAA GGCATTGGTC CCAAAACAGC CGTGGATCTG
CTGCAGGCTT GGGGTGATCT CGATCGCATT TACGACAATC TGGAAGCGAT CAAACCCGCC
GTTCGCAAGA AATTAGAGAG CGATCGCGAG GCAGCCTATC ACTCCCGCAA ACTGGCGCAA
ATCGTCACGG ATATTCCCCT AGCGATCGAC TGGGATCACT ATGCTCTGAC CGGCTTTGAT
GAACAGGAAG TCTTGCCTTG GCTGGAAAAA CTGGAACTGC AAGCCTTCCG GCGACAGGTC
GATCGCCTGC AACAGCTCTT CGGGGGCCAG CCACCAACTG TCGAAGCACT CAGTGATGAA
TCCCTCGACT TTTGGACCGC CGAAGAAACT GCTGCGCAGC AACCGCGCTG GCCACAACTG
CAGCCTCAGA TCATCACCAC GGCCGCAGCG CTGACTGACC TCGTGACTTT GCTGGAACAG
CGCGATAGTC CTGAAGCGAT CGTGGCTTGG GACACGGAAA CAACCGACCT CGATCCGCGC
TTGGCGCAAC TGGTGGGTAT TGGCTGCGCT TGGGGAGAAG AGCCAGATCA GCTTGCCTAC
ATTCCCCTCG GGCATGAGGA AGGCGAGCAA TTGCCGCTCC AGACAGTCCT CACCGCGCTA
CGACCGATTT TGGAAAGCGA TCGCCATCCC AAGGCGCTGC AAAATGCCAA GTTCGATCGC
CTGATTCTGC GTCACCAAGG CATTGAGCTG GCGGGTGTGG TCTTTGACAC AATGCTGGCT
AGCTACCTAC TCAACCCCAG TCTTGGCCAC AGCTTAGATG CTTTAGCCGA TCGCTGGTTA
AAGCTGCAAA CGCGCAGCTA CAGCGACCTC GTCCCCAAGG GCAAAACCAT CGCCCAAGTC
GCGATCGCGG CAGTTGCACA ATACTGCGGC AGCGATGTCC ATGTGGTGCA GCGGCTGATT
CCGTTGCTGA AAGCTGGGAT CGCCGAATCT CCGGCGCTTC AATTCCTGCT GGAAACCGTC
GAGCTGCCGC TCGAAGCCGT GCTTGCCGAG ATGGAAGATC GCGGCATTCG CATTGATGAA
GGATATCTAG CAGAACTGTC GGAGCATCTC AAGGGCGAGC TCGATCGCTT GGAAGGAGCA
GCGCATACCC TAGCGGGCGA TCGCTTTAAT CTCGGGTCGC CCAAGCAGCT GAGCGAACTG
CTGTTCGAGA AGCTAGGGCT GAATGTCAAA AAGTCCCGCA AGACCAAAAC GGGCTACTCC
ACCGATGCAG CCGTGCTCGA AAAACTCCAG GGTGATCACC CGATCATCGA CCTGATTCTG
GAGCACCGCA CCCTCGCCAA ACTGAAGTCG ACCTATGTGG ATGCGCTGCC GAGTCTGGTT
GCTGCCGATG GACGCATTCA TACTGATTTC AACCAAGCGG TGACGGCGAC GGGACGGCTG
TCCTCTTCTA ATCCCAACCT GCAGAACATT CCGATTCGCA CCGAATTCAG CCGTCAAATT
CGCAAGGCTT TTCTGCCCCG TGAAGGCTGG CTACTAGCCG CAGCAGATTA CTCGCAAATT
GAGCTGCGCA TCCTCGCTCA CCTCAGCCAA GAGCCAGTGC TGCTGGAAGC CTACCGGCAG
GGCGATGATG TACACCGACT TACGGCCAGT CTGCTCTTCG ATCGCGAGGA GATCACGTCC
GAGGAACGAC GCATTGGCAA AATCATCAAC TTTGGTGTGA TTTACGGCAT GGGTGCCCAG
CGCTTTGCCC GCGAAACTGG CAGCAGCACC AAGGAAGCCC AAGGCTTTAT CGATCGCTTC
TACGATCGCT ATCCTCGGGT GTTTACCTAC CTGCAAAGCC TGGAACGCCA AGCGATCGCC
CGCGGTTATG TGGAAACAGT CTTGGGGCGG CGGCGTTACT TTGACTTTGA GGACACTGGC
CTCCAGAAGC TACGCGGGAG CGATCCCGAG AGCATCGATC TCGACAAGAT TCGTCCCAGC
CGCTTCGAGG CGCAATTGCT GCGAGCCGCC GCCAATGCGC CCATTCAGGG GTCAAGTGCC
GACATCATCA AAGTGGCGAT GGTGCAGTTG CAGGCGCTGT TGCAGTCCTA TCAAGCGCGG
ATGCTGTTGC AAGTCCATGA CGAACTCGTC CTAGAACTGC CGCCGGAGGA ATGGGACAGC
CTCGCACCCC AAATCCAGCA GACGATGGAG CAGGCAGTTC AGCTGACCGT GCCGCTGGCT
GTGGAACTGC ATGCAGGCCA CAACTGGATG GAGGCAAAGT AA
 
Protein sequence
MSVDSPLLLL VDGHSLAFRS YYAFAKGRDG GLRTRTGIPT SVCFGFLKAL LEVMEQQQPK 
ALAIAFDLGG PTFRHEADEN YKANRDEAPE DFKIDTDNLV ALLQTLNLPI LVEPGYEADD
LLGTVAQRGA EAGYRVRILS GDRDLFQLVD PEGAIRVLYL GNTFGRSANR EAAREIDPAA
VIDKLGVPPE QVIDFKALCG DSSDNIPGVK GIGPKTAVDL LQAWGDLDRI YDNLEAIKPA
VRKKLESDRE AAYHSRKLAQ IVTDIPLAID WDHYALTGFD EQEVLPWLEK LELQAFRRQV
DRLQQLFGGQ PPTVEALSDE SLDFWTAEET AAQQPRWPQL QPQIITTAAA LTDLVTLLEQ
RDSPEAIVAW DTETTDLDPR LAQLVGIGCA WGEEPDQLAY IPLGHEEGEQ LPLQTVLTAL
RPILESDRHP KALQNAKFDR LILRHQGIEL AGVVFDTMLA SYLLNPSLGH SLDALADRWL
KLQTRSYSDL VPKGKTIAQV AIAAVAQYCG SDVHVVQRLI PLLKAGIAES PALQFLLETV
ELPLEAVLAE MEDRGIRIDE GYLAELSEHL KGELDRLEGA AHTLAGDRFN LGSPKQLSEL
LFEKLGLNVK KSRKTKTGYS TDAAVLEKLQ GDHPIIDLIL EHRTLAKLKS TYVDALPSLV
AADGRIHTDF NQAVTATGRL SSSNPNLQNI PIRTEFSRQI RKAFLPREGW LLAAADYSQI
ELRILAHLSQ EPVLLEAYRQ GDDVHRLTAS LLFDREEITS EERRIGKIIN FGVIYGMGAQ
RFARETGSST KEAQGFIDRF YDRYPRVFTY LQSLERQAIA RGYVETVLGR RRYFDFEDTG
LQKLRGSDPE SIDLDKIRPS RFEAQLLRAA ANAPIQGSSA DIIKVAMVQL QALLQSYQAR
MLLQVHDELV LELPPEEWDS LAPQIQQTME QAVQLTVPLA VELHAGHNWM EAK