Gene Syncc9605_1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_1966 
Symbol 
ID3735658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp1785070 
End bp1788030 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content64% 
IMG OID637776551 
ProductDNA polymerase I 
Protein accessionYP_382262 
Protein GI78213483 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.378529 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGG CCACCGCCAA GCCACTTCTG CTGCTTGTGG ATGGCCATTC CCTGGCATTC 
CGCAGCTTTT ACGCCTTCAG CAAGGGTGGA GAAGGGGGTC TGGCCACCAA AGACGGCCGG
CCCACCAGCG TGACCTACGG CTTTCTCAAA GCCCTGCTGG ACAACAGCAA AACCCTGCAG
CCAGAAGGCG TTGCCATTGC CTTCGACACA GCCGAGCCCA CCTTTCGCCA CAAGGCAGAC
GCCAACTACA AGGCCCACCG GGATGTGGCC CCTGAGGTGT TCTTTCAGGA CCTTGAGCAG
CTGCAGCAGA TCCTCGAAAC GCACCTGCAG CTCCCTCTCT GCATGGCCCC CGGCTTCGAA
GCCGACGATG TGCTGGGAAC CCTGGCGAAC CGAGCGGCCG ACTCGGGATG GGGCGTACGA
ATCCTTTCGG GAGACCGCGA CCTTTTCCAG CTGGTGGATG ACAGCCGCGA CATCGCGGTG
CTTTACATGG GTGGAGGCCC CTACGCCAAA AGCAGTGGCC CAACACTGAT CCGCGAAGAA
GGGGTGCTCG GCAAGCTCGG CGTGATACCC GACAAGGTGG TGGACCTCAA GGCCCTCACC
GGTGACAGCT CCGACAACAT CCCCGGCGTG CGTGGTGTTG GCCCCAAAAC AGCAATCAAC
CTGCTGAAAG ACAACAGCGA TCTCGACGCG GTGTACGCCA CCCTTGAGGA AGTGGAAGCC
GAAGGGCCGA AAGCCAGCCG GGGCGCCATC AAAGGAGCTC TGAAAGAGAA GCTGCGCGCC
GACCGCGACA ACGCTTACCT CTCGCGCAAG CTGGCCGAGA TTCTCGTGGA TGTTCCCCTG
CCCAAGGAAC CCAGCCTGCC GTTGTCGTCG GTGGATGCTG ATGGCCTGAG CAGCTGCCTG
GAAGACCTCG AGCTCAACAG CCTGCTGCGC CAGGTGGGGG GCTTTGTGGC GGCCTTTTCC
GAAGGGGGCT ACGGGGCCAA TGCAGAGGCC GCTGCAGCCA AGCCCCCCCA CCGCTCAGCC
ACCACAGAAC CGGCAGCTGC CGAAACCGCA ACCGAACCGG TCACCAACGA CGACGTGGGG
GTGCCGGCCC TGAAACCGAA GCTGATCCAG ACCGAAACGG CCCTGGACGC CCTGGTGCAG
AGGCTGATGG CCTGCACCGA CAGCAGCCTG CCTGTGGCCT TCGACACCGA GACCACCGAC
CTCAACCCGT TCCGCGCCGA ACTGGTGGGC ATCGGCATCT GCTGGGGAGA GGACCTGGAC
GCACTTGCCT ACATCCCGCT GGGCCACAAA GGCAGCGAAG ACAGCATCCC GGAGCAGCTG
TCCTTGGAAA CCGTGCTCAC CGCCCTCGCC CCTTGGCTGG CCAGCAGCAA CCACCCCAAG
ACCCTTCAGA ACGCCAAGTA CGACCGCCTG ATCCTGCTGC GGCATGGCAT TGCCCTCGAG
GGGGTCGTGA TCGACACACT GCTGGCTGAT TACCTGCGGG ATGCCTCGGC CAAGCATGGT
CTGGAGCTCA TGGCGGAGCG GGAATTCGGC TTCCAGCCCA CCTCATTCAC CGATCTGGTG
GGCAAGAAAC AAACCTTCGC CGACGTGCCG CTGGAGCCCG CCAGCCTGTA TTGCGGCATG
GATGTGCACG TCACCCGGCG CCTCGCCCTG CTGCTGCGCC ATCAGCTGGA GACCATGGGC
CCGCAGCTGC TGCCGCTGCT GGAGCAGGTG GAGCAGCCAC TGGAACCAGT GCTGGCCCGG
ATGGAATCCA CCGGCATCCG CATTGATGTG CCCTATCTCC AGGGCCTTTC CGAAGAAATG
GGCTCCACCC TTCAACAGCT GGAGTCCGAC GCCAAAGCGG CCGCAGGAGT GGACTTCAAC
CTGGCCTCAC CCAAGCAACT CGGAGAGCTC CTGTTCGACA CCCTCGGCCT GGATCGCAAG
AAATCCCGGC GCACCAAAAC TGGCTTCAGC ACCGACGCCA CCGTGCTGGA GAAACTCGGC
AACGACCACC CGGTGGTGCC TCTGGTGCTG GACCACCGGG TGCTCAGCAA GCTCAAGAGC
ACGTACATCG ACGCCTTGCC GCAGCTGGTG GAGGCGGAAA CCGGGCGAGT CCACACCGAT
TTCAACCAGG CGGTTACGGC AACGGGTCGG CTGAGCAGCA GCAACCCCAA CCTGCAGAAC
ATTCCGGTGC GCACCGAATA CAGCCGTCGG ATCCGCAAGG CCTTCCTGCC TCAGGAAGGC
TGGACCCTGC TCAGCGCTGA TTACTCACAG ATCGAGCTGC GCATCCTCAC CCACCTCTCG
GGCGAAAAGG TGCTGCAGGA GGCCTACCGC GGCGGCGATG ACGTGCACGC GCTGACCGCC
AGGCTGCTGC TGGACAAGGA CGAGGTCAGT CCGGACGAGC GTCGGCTGGG CAAAACGATC
AACTTCGGTG TGATCTATGG CATGGGAGCC CAGCGCTTCG CCCGGGAAAC CGGCGTAAGC
CAAAGCGAAG CCAAGGAGTT CCTGGTCAAG TACAAGCAGC GCTACCCGAA GGTGTTCGCC
TTTCTCGAGC TGCAGGAGCG GCTCGCCCTC AGCCGCGGTT ACGTGGAAAC GATTCTCGGC
CGCCGGCGGC CGTTCCACTT CGATCGCAAC GGCCTGGGGC GGCTGCTGGG CAAGGACCCC
TTGGAGATCG ACCTGGATGT GGCCCGCCGC GGTGGCATGG AGGCGCAGCA ACTGCGGGCA
GCTGCCAATG CCCCGATTCA GGGGTCGAGT GCCGACATCA TCAAAGTGGC GATGGTGCAG
CTGCAAGACG CCCTGCTGCG TCAGGGCCTA CCGGCCCAGC TGCTGCTGCA GGTGCACGAC
GAACTAGTGC TCGAAGTGGC GCCGGACGCC CTCGAGACCA CCCGAGACCT GGTGGTGCGA
ACCATGGAAC AGGCCTTTGA ACTGAGTGTT CCCCTAGTGG TGGAGACCGG CGTCGGCGCG
AACTGGATGG AGGCGAAATA A
 
Protein sequence
MPEATAKPLL LLVDGHSLAF RSFYAFSKGG EGGLATKDGR PTSVTYGFLK ALLDNSKTLQ 
PEGVAIAFDT AEPTFRHKAD ANYKAHRDVA PEVFFQDLEQ LQQILETHLQ LPLCMAPGFE
ADDVLGTLAN RAADSGWGVR ILSGDRDLFQ LVDDSRDIAV LYMGGGPYAK SSGPTLIREE
GVLGKLGVIP DKVVDLKALT GDSSDNIPGV RGVGPKTAIN LLKDNSDLDA VYATLEEVEA
EGPKASRGAI KGALKEKLRA DRDNAYLSRK LAEILVDVPL PKEPSLPLSS VDADGLSSCL
EDLELNSLLR QVGGFVAAFS EGGYGANAEA AAAKPPHRSA TTEPAAAETA TEPVTNDDVG
VPALKPKLIQ TETALDALVQ RLMACTDSSL PVAFDTETTD LNPFRAELVG IGICWGEDLD
ALAYIPLGHK GSEDSIPEQL SLETVLTALA PWLASSNHPK TLQNAKYDRL ILLRHGIALE
GVVIDTLLAD YLRDASAKHG LELMAEREFG FQPTSFTDLV GKKQTFADVP LEPASLYCGM
DVHVTRRLAL LLRHQLETMG PQLLPLLEQV EQPLEPVLAR MESTGIRIDV PYLQGLSEEM
GSTLQQLESD AKAAAGVDFN LASPKQLGEL LFDTLGLDRK KSRRTKTGFS TDATVLEKLG
NDHPVVPLVL DHRVLSKLKS TYIDALPQLV EAETGRVHTD FNQAVTATGR LSSSNPNLQN
IPVRTEYSRR IRKAFLPQEG WTLLSADYSQ IELRILTHLS GEKVLQEAYR GGDDVHALTA
RLLLDKDEVS PDERRLGKTI NFGVIYGMGA QRFARETGVS QSEAKEFLVK YKQRYPKVFA
FLELQERLAL SRGYVETILG RRRPFHFDRN GLGRLLGKDP LEIDLDVARR GGMEAQQLRA
AANAPIQGSS ADIIKVAMVQ LQDALLRQGL PAQLLLQVHD ELVLEVAPDA LETTRDLVVR
TMEQAFELSV PLVVETGVGA NWMEAK