Gene Syncc9902_0693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_0693 
Symbol 
ID3743976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp701399 
End bp704395 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content60% 
IMG OID637770865 
ProductDNA polymerase I 
Protein accessionYP_376705 
Protein GI78184270 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.433628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGAGG CCGTCACCAA GCCCCTACTC CTGCTTGTGG ATGGCCACTC ACTGGCTTTT 
CGCAGTTTTT ACGCCTTCAG CAAAGGCGGC GAAGGGGGCC TTGCCACAAA AGATGGACGG
CCAACGAGTG TCACTTATGG CTTTTTGAAG TCGCTCCTGG ACACCGGCAA AACGTTCAAG
CCCCAGGGCG TCGCGATTGC CTTCGACACC GCGGAACCCA CATTCCGCCA CAAAGCGGAC
ACGAATTACA AAGCCCACCG GGACGTGGCA CCAGAGGTTT TCTTCCAAGA CCTCGAACAG
TTGCAGCAGA TCCTTAGCGA TCAAATGAAG TTGCCCCTGT GCATGGCACC GGGATACGAA
GCCGATGACG TGCTCGGCAC CCTTGCCAAC CGTGCCGCCG ACGCCGGCTG GGGGGTTCGG
ATTCTCTCGG GAGACCGTGA TCTTTTTCAG CTCGTCGACA ACAGCCGCGA CATCGCGGTG
CTGTACATGG GTGGCGGGCC CTATGCCAAG GGAAGCGGGC CAACGCTGAT TCAAGAGGAC
GGCGTTGTGA GCAAGTTGGG AGTCATGCCC GACAAGGTGG TGGACCTCAA GGCCCTCACA
GGAGACAGCT CCGACAACAT CCCAGGGGTG CGGGGCGTTG GTCCCAAAAC CGCCATCAAC
CTGCTGAAAG ACAACATCGA CCTGGACGCG GTGTACGCGA CCTTGGCGGA GGTGGAAGCT
GAAGGTCCGA AGGCCAGTCG CGGGGCCATC AAAGGGGCCT TGGTCGGCAA GCTGCGCTCC
GATCGCGACA ACGCCTATCT GTCGAGAATG CTGGCGGAAA TCCTCGTGGA CATTCCCCTC
CCGAAGGATC CAGAACTCGG CCTACAGCAG GTTGATGCTG AAGGGTTGAG CGATCGCCTC
GAAGACCTCG AGCTCAATAG CCTCGTCCGT CAAGTCGGTG GGTTTGTGGC GACCTTTTCC
ACCGGTGGGT ATGGCGCCAA TGCCAGTAGC GAAGCTGAGA AGACCCCTAA ACGCTCCCCA
GCGAAAACGA ACCCTGATTC CACCAGTTCA ATCGGGAGTG CAGCCGCTCC GCAGCAAGGT
TCAGGGGAAG CGGAGGTTGG AGCAATCCCT CCCCTCCAAC CGCAGCTGAT CGAGAGCAAC
GAAGCCCTGC AAGCCCTTGT GCAACGGCTG ATGGACTGCA CCGATCCCAA AGCGCCAATT
GCCGTCGACA CCGAAACCAC AGATCTCAAC CCATTCAAGG CCGAACTCGT GGGGGTAGGG
GTGTGCTGGG GCGAGGCCAA CGATGCCCTC GCCTACATCC CCATCGGCCA CAAGCCCCCC
AGCGAACTCT CCGAGGCAAC ACCACCCCAG CAGCTTCCGC TGGAAACGGT GCTAATGGCG
CTCTCGCCCT GGCTTGCGAG TCCGCAGCAC CCCAAAGCCC TACAGAACGC CAAATACGAC
CGACTCATCC TGCTACGCCA TGGGCTCACG CTCAACGGCG TCGTGATCGA CACCCTGCTG
GCGGATTACC TGCGGGATGC CGCGGCCAAG CACGGTTTGG ACCTGATGAG CGAACGGGAG
TTTGGCTTCC GCCCCACCAC CTACGGCGAT CTTGTTGGCA AGAAACAAAC CTTTGCCGAT
GTGGCAATCG AACCTGCCAG CCTCTATTGC GGCATGGACG TGCACGTCAC CAGGCGATTG
GCCCTGCAGC TCCGCCAAAC CTTGCAAGCG ATGGGCCCCC AATTACTGCC TCTCCTGGAG
GGCGTTGAAC AACCGCTGGA ACCCGTTCTG GCCCAAATGG AGGCCACCGG CATCCGCATC
GATGTGCCTT ACCTCAAAAC ACTGTCGGAC GAATTAGGCA GCACCCTGAA CCGGTTAGAA
ACCGACGCCA AACAGGTCGC TGAGGTGGAC TTCAACCTCG CCTCTCCAAA GCAACTTGGG
GAATTGTTGT TCGACACGCT CGGACTGGAT CGCAAGAAGT CGAGGCGCAC GAAAACCGGA
TTCAGCACTG ATGCCACCGT TCTCGAAAAA CTCGAAAACG ACCATCCCGT CGTTCCTCTC
GTGCTGGAGC ACCGCGTCTT GAGCAAGCTC AAGAGCACCT ACGTTGATGC TCTGCCCCAA
CTCGTCGAAG CGGAAACCGG CCGCGTCCAC ACCGACTTCA ACCAAGCCGT AACAGCGACG
GGCCGCTTGA GCAGCAGCAA CCCAAATCTG CAAAACATTC CCGTTCGCAC GGAATACAGC
CGTCGCATCC GCAAAGCGTT CCTCCCCCAA GAGGGCTGGA CACTGCTCAG CGCCGACTAC
TCCCAAATCG AACTCCGAAT CCTCACCCAC CTCTCCGGGG AAGAGGTGCT GCAGGAGGCC
TACAGCACGG GCGACGACGT GCACGCACTC ACCGCACGCT TACTGCTGGA TAAAGACGAC
GTGAGCGCCG ATGAACGTCG CCTCGGAAAA ACGATCAACT TCGGGGTGAT TTACGGCATG
GGCGCCCAAC GCTTTGCGCG GGAAACAGGG GTGAGCTCGG CCGAAGCGAA GGAGTTCCTC
ACCAAATACA AACAGCGCTA CCCCAAAGTG TTTGCCTTCC TTGAGCTTCA GGAGCGGCTC
GCCCTAAGCC GCGGCTACGT GGAAACAATC TTGGGTCGTC GTCGCCCATT TCATTTCGAT
CGCAACGGCC TCGGCCGCTT ATTGGGAAAA GATCCCCTCG AAATTGATCT CGATGTGGCA
CGGCGAGGTG GGATGGAAGC ACAACAACTG CGCGCCGCCG CCAACGCCCC CATTCAGGGC
TCCAGCGCCG ACATCATCAA GGTGGCGATG GTGCAATTAC AAGCGGTGCT TCTTAGCCAA
GGGATCCCCG CCCGCCTACT CCTGCAGGTG CATGACGAAC TGGTCCTCGA AGTGGCGCCA
GACGCATTGG ACACCACGCG AAACCTTGTG GTGAACACCA TGGAAAACGC CGTCAAGCTC
AGCGTGCCTC TCGTGGTGGA AACCGGCGTT GGTCGCGACT GGATGGAAGC GAAATAA
 
Protein sequence
MPEAVTKPLL LLVDGHSLAF RSFYAFSKGG EGGLATKDGR PTSVTYGFLK SLLDTGKTFK 
PQGVAIAFDT AEPTFRHKAD TNYKAHRDVA PEVFFQDLEQ LQQILSDQMK LPLCMAPGYE
ADDVLGTLAN RAADAGWGVR ILSGDRDLFQ LVDNSRDIAV LYMGGGPYAK GSGPTLIQED
GVVSKLGVMP DKVVDLKALT GDSSDNIPGV RGVGPKTAIN LLKDNIDLDA VYATLAEVEA
EGPKASRGAI KGALVGKLRS DRDNAYLSRM LAEILVDIPL PKDPELGLQQ VDAEGLSDRL
EDLELNSLVR QVGGFVATFS TGGYGANASS EAEKTPKRSP AKTNPDSTSS IGSAAAPQQG
SGEAEVGAIP PLQPQLIESN EALQALVQRL MDCTDPKAPI AVDTETTDLN PFKAELVGVG
VCWGEANDAL AYIPIGHKPP SELSEATPPQ QLPLETVLMA LSPWLASPQH PKALQNAKYD
RLILLRHGLT LNGVVIDTLL ADYLRDAAAK HGLDLMSERE FGFRPTTYGD LVGKKQTFAD
VAIEPASLYC GMDVHVTRRL ALQLRQTLQA MGPQLLPLLE GVEQPLEPVL AQMEATGIRI
DVPYLKTLSD ELGSTLNRLE TDAKQVAEVD FNLASPKQLG ELLFDTLGLD RKKSRRTKTG
FSTDATVLEK LENDHPVVPL VLEHRVLSKL KSTYVDALPQ LVEAETGRVH TDFNQAVTAT
GRLSSSNPNL QNIPVRTEYS RRIRKAFLPQ EGWTLLSADY SQIELRILTH LSGEEVLQEA
YSTGDDVHAL TARLLLDKDD VSADERRLGK TINFGVIYGM GAQRFARETG VSSAEAKEFL
TKYKQRYPKV FAFLELQERL ALSRGYVETI LGRRRPFHFD RNGLGRLLGK DPLEIDLDVA
RRGGMEAQQL RAAANAPIQG SSADIIKVAM VQLQAVLLSQ GIPARLLLQV HDELVLEVAP
DALDTTRNLV VNTMENAVKL SVPLVVETGV GRDWMEAK