Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3590 |
Symbol | |
ID | 7269734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 4365004 |
End bp | 4367814 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643568398 |
Product | DNA polymerase I |
Protein accession | YP_002464864 |
Protein GI | 219850431 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000120816 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGCGTC CGCTCTTGAT TTTGGTTGAC GGTCACGCAC TGGCATATCG CGCATTCTTT GCTTTGCGCG AGAGCGGCTT GCGCTCGTCG CGAGGTGAAC CGACGTATGC CGTCTTTGGG TTTGCTCAAA TCTTGTTAAC GGCACTTGCC GAGTACCGGC CCGATTATGT TGCGGTGGCG TTTGATGTTG GGCGAACGTT TCGCGATGAC CTATACGCCG AATACAAAGC CGGTCGCGCC GAGACGCCGG AAGAGTTTTA TCCGCAATTC GAGCGCATTA AACAGTTGGT GCAGGCGTTG TCAATCCCTA TCTATACCGC CGAGGGGTTT GAGGCCGATG ATGTCATCGG TAGCTTAGCT CGCCAGGCTA CCGAGCAGGG TGTTGATACG ATTATTCTTA CCGGCGATAC CGATACGCTA CAACTGGTGA ATGAGCACGT TCGGGTGGCG CTCGCCAATC CTTATGGCGG CAAGACGAGT ACCACTCTGT ACGATGTCGA ACAGGTGCGT AAACGTTACG ATGGCCTCGA GCCGGCGCAG TTGGCCGATC TGCGCGGTCT GAAGGGCGAT TCATCCGACA ATATCCCCGG TGTGCGCGGG ATTGGTGAGA AGGGAGCGAT TACCCTCCTC AAGCAGTTCG GTTCTCTCGA TAAGCTCCTG GATAACATTG AGGCAGCGCC CAAGCGCTAT CAACATCTCT TGCGTGAACA GGCCGACCAG GCGCGCTCGT CGCGGCATTT GGCGACGATC GTCACCGATG CGCCGGTGCA ACTCGATCTG GCAAAGTGTC GGCTTGGCGT GTACGATCGT GCGGCAGTAA TGGCTTTGCT CCAAGAGCTG GAGTTTGGGG TTAGCTCGAA CCTGATCAAA AAGCTGCCGT CGGTCGTGCA AGCGGCGACG GTTGCAACTT TACCGGCCGA TCTACCGACT GCACCACAAG GCTCGGTTCA GTTAGCGTTG TTTGCCAACG AGTCGGCATC GCCGACGATG GTTTCCTCGG TTACGTCGGC ACAGATTGTC CGCGATCCGC AAGCGCTGGC CGAGTTGGTA CAACGGTTGC GGGCTGCGCC GGGATTTGCA TTCGATACCG AATGTACGAG TCTGCAAGCC GTCGGTAGTC ATCTGGTAGG GATTGCGCTG GCTATCGCAC CTAACGATGC CTACTACGTA CCGGTTGGTC ACGAGGAGGG TGAGCAGTTG CCGTTGGCCG ACGTGGTGGC TGCACTTGGC CCGCTGTTTG CCGACCCCAA CATACCCAAA TTTGCCCACA ATGCGAAGTT CGATGCCGAG GTATTGGCCG GTGTCGGCAT ACAGGTGGCC GGTCTGGCGT TTGATACCAT GATCGCGGCG GCAATGTTAG GTAAACGACA AGGGTTGAAG GATCTGGCAT TCTACGAATT GAAACTGCCC GAACCACCGA CCACGATTGA AGATCTGATC GGGCGAGGTA GCAAGCAGAT CAGCTTTGCT GCGGTACCGA TCGAGCAAGC CGCTCCCTAC GCCGCCGCCG ATGCGCTGCA TACCTTACTA CTGACCGAAA CCTTGCGCGG GCAACTCACA ACTGACACCG CCCTCCGTGA TCTCTACTAT CGGGTCGAGC TACCGCTGAT CGACGTGCTA ACCGATATGG AGTTGACCGG TATCTTGCTC GATCACGAGT ATCTGCGCGA ACTGGGTAAA CGGTTTGCCC AACGTATCGC CGAGCTGACC GAACAGATTT ATGCGAAGGC CGGTGGGCCG TTTAATATCA ATTCCGGCCA ACAACTCAAC GAGGTCTTGT TTGAGCGACT GGGTATCAAT CCGCGTGATT ATGGGCTGAG TAAGCTCAAG AGTGGTGGTT ATTCGATTAC TGCCGAGGTG TTAGAGGAGC TAAGCCAACT CTACCCGATT GCCGCCGATA TTCTGGCTTA CCGTCAGCTT ACCAAGCTGA AGAGTACGTA TATTGACGCT CTGCCCCAAC TGGTGAATCC ACGTACCGGA CGCATCCATA CCTCGTACAA CCAGATCGGC GCTGCAACGG GTCGGCTGTC GTCGAATAAT CCTAACCTGC AAAATATTCC GGTGCGCACC GAAGAGGGAC GGGAAATCCG GCGCGCGTTC GTCGCTGCTC CGGGCCACCG TTTCGTCGCC GCCGACTACT CGCAGATTGA GTTGCGTGTG TTGGCCCACA TCAGTGGCGA CGAAAACCTG ATCGCCGCTT TTCAGCAAGG TCTTGATATT CACGCCGCTA CGGCCAGCCG ACTGTTTGGC GTAGCCCCTG ATCAGGTTGA CAAAAACCAG CGTCGTGTCG CCAAGACGGT GGTGTTTGGC GTTATTTACG GAATTAGCGC TTTTGGTCTT GCCCAACGGC TAGGTATCGA ACGCGATCTG GCGCGTCAAT TGATCGACAA CTTGTTCGAG CAGTTCCCCG GCATCCGCCG CTATATCGAT CAAACGCTCG CATTTGGCCG GCAACACGGG TATGTGCAAA CGTTGTTTGG CCGGCGGCGA GTGATGGAAG ATTTGCGGGC GAGTGGAGCA CGACGGGCGG CTGCCGAGCG CGAGGCGATA AACGCACCGA TACAGGGCAC TGCCGCCGAC ATCATGAAAA TGGCGATGGT CTATGTCCAT CGCGCTTTAC GCGAACGCGG TCTCCGCACT CGTTTGCTCT TGCAGGTGCA TGATGAGCTG ATCGCCGAAG CGCCGGAGGA AGAGGTTCCA GCGGCAGCTC ATCTGTTGCG TGAGGTGATG AGTAATACCT ACCAATTGGT TGTGCCGCTC GGCGTCAATC TCGAAACCGG GCCTAATTGG GAAGAGATGG CGGCGGTGTG A
|
Protein sequence | MARPLLILVD GHALAYRAFF ALRESGLRSS RGEPTYAVFG FAQILLTALA EYRPDYVAVA FDVGRTFRDD LYAEYKAGRA ETPEEFYPQF ERIKQLVQAL SIPIYTAEGF EADDVIGSLA RQATEQGVDT IILTGDTDTL QLVNEHVRVA LANPYGGKTS TTLYDVEQVR KRYDGLEPAQ LADLRGLKGD SSDNIPGVRG IGEKGAITLL KQFGSLDKLL DNIEAAPKRY QHLLREQADQ ARSSRHLATI VTDAPVQLDL AKCRLGVYDR AAVMALLQEL EFGVSSNLIK KLPSVVQAAT VATLPADLPT APQGSVQLAL FANESASPTM VSSVTSAQIV RDPQALAELV QRLRAAPGFA FDTECTSLQA VGSHLVGIAL AIAPNDAYYV PVGHEEGEQL PLADVVAALG PLFADPNIPK FAHNAKFDAE VLAGVGIQVA GLAFDTMIAA AMLGKRQGLK DLAFYELKLP EPPTTIEDLI GRGSKQISFA AVPIEQAAPY AAADALHTLL LTETLRGQLT TDTALRDLYY RVELPLIDVL TDMELTGILL DHEYLRELGK RFAQRIAELT EQIYAKAGGP FNINSGQQLN EVLFERLGIN PRDYGLSKLK SGGYSITAEV LEELSQLYPI AADILAYRQL TKLKSTYIDA LPQLVNPRTG RIHTSYNQIG AATGRLSSNN PNLQNIPVRT EEGREIRRAF VAAPGHRFVA ADYSQIELRV LAHISGDENL IAAFQQGLDI HAATASRLFG VAPDQVDKNQ RRVAKTVVFG VIYGISAFGL AQRLGIERDL ARQLIDNLFE QFPGIRRYID QTLAFGRQHG YVQTLFGRRR VMEDLRASGA RRAAAEREAI NAPIQGTAAD IMKMAMVYVH RALRERGLRT RLLLQVHDEL IAEAPEEEVP AAAHLLREVM SNTYQLVVPL GVNLETGPNW EEMAAV
|
| |