Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan7425_0850 |
Symbol | |
ID | 7286766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7425 |
Kingdom | Bacteria |
Replicon accession | NC_011884 |
Strand | + |
Start bp | 670468 |
End bp | 673416 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643583859 |
Product | DNA polymerase I |
Protein accession | YP_002481597 |
Protein GI | 220906286 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0300789 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTAGCC CTGCTGCGAT CGTTCAACCC AACCTTGTCC CAGAGACAAA TAAAACCCTT CTCCTGGTGG ATGGGCATTC CCTCGCTTTC CGTTCCTACT ATGCCTTTGC GCGGGGTCGG GAGGGCGGTT TACGCACCTC TACAGGAATT CCCACCAGCG TTTGCTATGG ATTTCTTAAA TCTCTGCTGG AAGTAATGCA AATTGAGCAG CCGGGGTACC TGGCGATCGC CTTTGACCTG GGGGTACCAA CGTTTCGCCA CGAAGCAGTG GAAAGCTACA AAGATGGGCG ACCGGAGACC CCAGAAGACT TTAAACCTGA CTTAGAAAAC CTGAAAGAGT TGTTGGCGGC CCTGAACCTG CCCGTTCTCA CCCTGGCAAC CTACGAAGCG GATGACATTA TTGGGACTTT AGCGCGAAAA GCCAGTCAGT CCAACTATCG GGTCAAAATT CTCAGTGGTG ACCAGGATCT CTTTCAACTG ATCAACCCGG AACAAGGCGT GACGGTCTTA CACCTGAGCA GCACCTTCGC CCCCCGCACA CCCCTCGGTA CTGGCCATCC CCGCGAATTT GGTACGGAGC AGGTGAAAGA AAAACTGGGG ATTCTGCCCA GCCAGGTGGT GGACTACAAA GCCCTCTGTG GCGATAGTTC AGATAATATT CCTGGTGTCA GGGGAATTGG CCCCAAAACC GCGGTGCAAT TGCTCAGCCA GTTTGGCTCC CTGGAGCAAA TTTATCAATC CCTCGACCAG ATCAAGGGAG CCACCCGCCA AAAGCTGGAG TCTGGTCATG CCTCCGCCCT GCAATCTCAG TACTTAGCTC AAATTCATCT GGATGTTCCC CTGGAGGCTA CCCCGGACCA CCTGGAACTG CGCGGCTTTG AAACCTCAAA GGTCATACCC CTGTTAGAAA AACTGGAATT TCAATCCTTT ATGGGTCAAA TTTCCCGTTT ACAGCAACGT TTTGGGGGTG GGGCCATTGC TGCAGCCAGT GAGGGGGCAG CGGGTAGTGA TCTAGCGGAC AACGATCCTG ATCTCTGGTT TTTCAGTGCG GCAGATACGG CAGCGGCCCA GGCCCAACCT GCAGTTCTGA TTTCACCTCA AATTATTGAT ACCCCTGAAA AGTTAGCGGC TCTGGTCGAA ACCCTGCAAA CCTGTACTGA TCCCGGAACC CCTGTGGCCT GGGATACGGA AACCACAGCC CTTTCCCCCA GGGATGCTCA GCTTGTTGGC ATTGGTTGCT GCTGGGGCCC GGCACCGGAT CAAATTGCTT ATCTCCCCCT CGGTCACAGT CAAGGTGGCA ATCTGGATCT GACCCTGGCG CTGGAACAAT TGCGTCCCCT GCTGGAAGAT GCCCGCTATC CTAAAGCGTT TCAGAATACC AAGTACGATC GCCTGGTGCT GAAATTTCAG GGGGTTCATC TGCAGGGTGT GGTGTTCGAT ACGATGCTGG CCAGCTATGT GCTCAACCCT GAAGGCAGCC ATAACCTGAC GGATCTCTCC CGTAAATACC TGGACATTAC GGCTCAGAGT TATACCGACC TGGTCGAAAA GGGGCAGAAT TTAGCTGATC TGAGTGTGGA GGCGATCGCT AACTACTGTG GGCTGGATGT CTATGCCACA TTCCAACTGG TGCCTAAACT GCGGGCCGAC CTGCTCGATG CCAATCCCAA ACTGCTGGAC CTGCTTTTAA CCGTTGAGCA ACCCCTGGAA CCCGTGCTGG CGATCATGGA AACCTGCGGG ATTCGGATCG ACCAGGACTA CCTCAAACAG TTATCCCAGC AACTAGAACA GGATCTGCAC GCCCTGGAAG AACGGGCCTA TGGGTTGGCG GGAGAAACCT TTAACCTGGG CTCCCCCAAA CAGTTAAGTG AGCTGCTGTT TAATAAGCTG AAGCTGGATG TGAAAAAGTC CCGCAAAACC AAGCTGGGGT ACTCCACGGA TGCCGCCACG CTGGAAAAAA TGCAGGGAGA TCATCCGGTG ATTGATCTGA TCCTGGAACA CCGCACTCTG GCCAAGCTGA AATCAACCTA TGTCGATGCC CTGCCCAATC TGGTTCGCAG CGATACGGGT CGGGTGCATA CGGATTTTAA CCAGGCGATT ACGGCGACGG GACGGCTGTC TTCCTCCAAT CCAAACTTGC AAAATATTCC GATTCGGACT GAGTTTAGCC GCCAGATCCG TAAAGCCTTT ATCCCGGAAC CGGGCTGGAT GCTCGTCACC GCAGACTACT CCCAGATTGA ATTGCGCATT CTGGCCCACC TCAGTCAGGA ACCCCGTTTA GTGGAAGCCT ACCAGCAGCA TCAGGATGTC CACAAACTGA CCGCCCAATT GCTCCTGGAA AAAGAGGACA TCAACCCGGA GGAACGGCGT CTAGCTAAAA TCATCAACTT TGGCGTCATC TATGGCATGG GGGCACAACG GTTTGCCCGC GAGTCTGGCG TTAGTTTTAA GCAGGCCAAG GGGTTTATCG ATCGCTTCTA TGAACGCTAT CCCCTGGTGT TCCGCTATCT AAAGCAGATG GAAGGGCAGG CGATCGCCCT AGGCTACGTC GAAACCCTGC TGGGGCGGCG GCGCTACTTC AGTTTTGATA GCCGAGAGTT ACAACAATTG CGGGGTAAAG ATCCCGATAG TCTGGCAGAG CTGGATTTAA GTCAACTGAA AGTCAGTCAG TACGATCGTG GTCAGTTGCG GGCCGCTGCT AATGCCCCGA TTCAGGGTTC CAGTGCAGAC ATCATCAAAG TGGCGATGAT CAAGCTACAG CAACGACTCC AACCCTATCA GACCCGCCTG CTGTTGCAAG TTCACGATGA ACTCGTGTTT GAAATGCCCC CCTCCGAATG GGAACCGTTG CGATCGTTGA TCCAGGCCAC CATGGCGGAG GCGGTTCCCC TCTCCATTCC CCTGCTGGCG GAGATTCATG CTGGACCGAA CTGGATGGAA GCGAAATAA
|
Protein sequence | MTSPAAIVQP NLVPETNKTL LLVDGHSLAF RSYYAFARGR EGGLRTSTGI PTSVCYGFLK SLLEVMQIEQ PGYLAIAFDL GVPTFRHEAV ESYKDGRPET PEDFKPDLEN LKELLAALNL PVLTLATYEA DDIIGTLARK ASQSNYRVKI LSGDQDLFQL INPEQGVTVL HLSSTFAPRT PLGTGHPREF GTEQVKEKLG ILPSQVVDYK ALCGDSSDNI PGVRGIGPKT AVQLLSQFGS LEQIYQSLDQ IKGATRQKLE SGHASALQSQ YLAQIHLDVP LEATPDHLEL RGFETSKVIP LLEKLEFQSF MGQISRLQQR FGGGAIAAAS EGAAGSDLAD NDPDLWFFSA ADTAAAQAQP AVLISPQIID TPEKLAALVE TLQTCTDPGT PVAWDTETTA LSPRDAQLVG IGCCWGPAPD QIAYLPLGHS QGGNLDLTLA LEQLRPLLED ARYPKAFQNT KYDRLVLKFQ GVHLQGVVFD TMLASYVLNP EGSHNLTDLS RKYLDITAQS YTDLVEKGQN LADLSVEAIA NYCGLDVYAT FQLVPKLRAD LLDANPKLLD LLLTVEQPLE PVLAIMETCG IRIDQDYLKQ LSQQLEQDLH ALEERAYGLA GETFNLGSPK QLSELLFNKL KLDVKKSRKT KLGYSTDAAT LEKMQGDHPV IDLILEHRTL AKLKSTYVDA LPNLVRSDTG RVHTDFNQAI TATGRLSSSN PNLQNIPIRT EFSRQIRKAF IPEPGWMLVT ADYSQIELRI LAHLSQEPRL VEAYQQHQDV HKLTAQLLLE KEDINPEERR LAKIINFGVI YGMGAQRFAR ESGVSFKQAK GFIDRFYERY PLVFRYLKQM EGQAIALGYV ETLLGRRRYF SFDSRELQQL RGKDPDSLAE LDLSQLKVSQ YDRGQLRAAA NAPIQGSSAD IIKVAMIKLQ QRLQPYQTRL LLQVHDELVF EMPPSEWEPL RSLIQATMAE AVPLSIPLLA EIHAGPNWME AK
|
| |