Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0179 |
Symbol | |
ID | 3747705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 200023 |
End bp | 202797 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637772706 |
Product | DNA polymerase A |
Protein accession | YP_378500 |
Protein GI | 78188162 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATGC TTTACCGTGC TTTTTTTGCG TTGCAGCGCA CAGGCATGAG TAGCCCTTCG GGGTTGCCAA CGGGTGCGCT CTACGGCTTT ACCACAGCGT TGCTTAAAAT TTTTGAGAAT TATCATCCTC ACTACTTAGT TGCGGCATTT GATAGCCGCG AAAAAACCTT TCGCCACCAT TTGCTTGAGA GCTATAAGGC AAATCGTGCA GCTCCACCTG AAGAGCTGTT ACAGCAACTT GAAAAGTTGT TTGAGTTGTT GAAAGCTTTT GGAGTGCCTG TTATTAAGCA AGCGGGTTAT GAAGCTGATG ATCTTATTGG CGCGATGGTT ACTCAGTTTG CGGATGTTTG CCGCATTGGC ATTGTTACGC CCGATAAAGA TTTAGCGCAG CTTGTGCGCG AAGGTGTGCA AATTTTAAAG CCGGGGAAAA ATCAGCATGA GTTAGAGCCG CTTGGTTGCA ATGAGGTGAA AGCTCACTTT GGCGTTCCTC CCAAACAATT CACCAATTTT TTAACCTTAA CCGGTGATAC GTCGGATAAC ATTGTGGGCG CTAAAGGCAT TGGTCCAAAA ACCGCCGCAA CCTTGCTTGA AAAATATCAA ACCTTAGATA AGCTTTACCA ACACTTGGAT GAGTTAACGC CAAAGGTGCG GAAAAGCCTT GAGGATTTTG CACCGAATCG GGAGTTGGTG CTGCAACTTG TTACCATTTG CTGCGATGCG CCGCTCCATG TTACGTTAGA GGAACTTGCT TGCAAAAATC CCGCACGAGA TGTTGTGCTG CCGCTCTTGC AAGAGTTGGG CTTCCGTACC ATTGCTGCTC GTTTACAAGC TGCGTCCGTG GCACTTACAT GCGCTTGTAA TGATGGGGGG GAAAGTGCTC CACCAATGCA AAGCGATCCT AATAGTTCCA ACCTTTTAAA CGGAAGTGAT GGCAATACTT CGGCAACCGA TACCGCTCCC CCACCATCAT TCCCAGACGT TCCTCGCCAT TACACCCTTG TAGAAACAAG AGAGCAATTG CAGGCGTTGC TTGAGGAGTT GCAACAGGTT ACGCATATAG CGGTTGATAC CGAAACCACA AGCCTTGATG TTTTTGAAGC TGAGCTGGCA GGAATTTCGC TTTGTGCTGA AGCGGGTAAA GCATTTTTTA TTGCCACTAC GCCCGATGCT CTTGAGAGAA AAGAGGTTGT CAAGCAACTC AAACCACTGC TTGAAAATCC CGCAATTACG AAAAGCGGGC AGAATTTGAA GTACGATATG CTGGTGCTGA AAAAGTATGG CATTGAACTT GCACCCATCA GCTTTGATAC CATGCTTGCA AGTTATGTGC TTAACCCCGA TGAGCACCAC AATCTCGACG ACATGGCACT GCGTTACCTT GGGCGCACCA CCACCAAGTA TGATGAGCTT ACGGGCACAG GCAAACAGCG CCGCCATATT TTTGAGGTGG AAAAAGAGGC ACTCACCAAC TACGCCTGCC AAGATGCCGA TGTGGCTTTT CAACTGGAAG AGGTGCTGCA AGCCCAACTG CAAGCCGAGC CGCAACTGCT GGCACTTTGC ACCACTATGG AGTTCCCGCT TGTGCGCGTG TTAGCAACAA TGGAGTATGC TGGTATTGCT ATTGATACCG AGCATCTTGC CCGTGTAGCC GAAACCACCG AGCTGGAACT TCAATCCTTA ACAGACAACA TTTACGCGGC GGCTGGTAGC TCTTTTAATA TTGATTCACC CAAACAGCTT TCGCACGTAC TCTTTACCGA TCTTAGCTTG CCAACAGGTA AATCCACCAA AACAGGCTTT TCAACCGATG TTGGCGTTTT GGAGGAGTTG GCTGCAACCT ACCCCATCGC AAGCGATTTA CTGAGCTACC GCACGTTGCA AAAGTTAAAA GGAACCTACA TTGAGGCGCT GCCAAAAATA ATCAATCCAC GCACAGGACG CATTCATACC TCCTTTAACC AGCACATTAC CGCAACGGGC AGGCTCTCAT CCTCAAATCC CAACCTGCAA AACATTCCCG TTCGCACGGC GCTTGGTAAG GAAATTCGCC GCGCCTTTAT TCCTTCAACC CCCGAACATT GGCTGCTTTC GGCTGATTAC TCGCAAATTG AGCTGCGCAT TGCCGCTGAG CTTTCGGGCG ATGAGCGCTT GATTGCTGCT TTCCGCAACG GCGAGGATAT TCACACCGCA ACGGCACAAG TGATTTTTGG AACGGAGGAA ATTAGTAGCG ATATGCGCCG CAAAGCTAAA GAGGTGAACT TTGGCGTGCT CTACGGCATT CAGCCTTTTG GGTTAGCAAA GCGCTTGAAC ATTCCCCAAA AAGAGGCAAA AGTTATTATT GAAACCTATA AAGCTAAATA TCCACAGCTC TTTAATGTGT TGCGCCATAT TATTGAGGAG GGAAAAGAAA AAGGCTACGT TACCACCCTT TTGGGGCGAC GACGCTACAT TGCTGACCTT AACAGCCGCA ATGGCACCGT ACAAAAAGCT GCCGAACGCG CCGCTATGAA TACGCCCATT CAAGGTACAG CGGCAGATAT TATTAAGTGC GCTATGAACC TTTGTTATCA GCAAATGCAA GCGTCAGGCA TGGCTTCCGA AATGCTCTTG CAAGTGCATG ATGAATTGCT TTTTGAAACC ACTGATAGCG AAAAAGAGGC ACTAACAAAG CTTGTAGAAA ATGCCATGAA AGAGGCTGCG GTGCTTTGCG GCATGAAGCA AGTGCCGGTG GAGGTTGATT GCGGAGTTGG AAAAAATTGG CTTGAAGCCC ATTGA
|
Protein sequence | MAMLYRAFFA LQRTGMSSPS GLPTGALYGF TTALLKIFEN YHPHYLVAAF DSREKTFRHH LLESYKANRA APPEELLQQL EKLFELLKAF GVPVIKQAGY EADDLIGAMV TQFADVCRIG IVTPDKDLAQ LVREGVQILK PGKNQHELEP LGCNEVKAHF GVPPKQFTNF LTLTGDTSDN IVGAKGIGPK TAATLLEKYQ TLDKLYQHLD ELTPKVRKSL EDFAPNRELV LQLVTICCDA PLHVTLEELA CKNPARDVVL PLLQELGFRT IAARLQAASV ALTCACNDGG ESAPPMQSDP NSSNLLNGSD GNTSATDTAP PPSFPDVPRH YTLVETREQL QALLEELQQV THIAVDTETT SLDVFEAELA GISLCAEAGK AFFIATTPDA LERKEVVKQL KPLLENPAIT KSGQNLKYDM LVLKKYGIEL APISFDTMLA SYVLNPDEHH NLDDMALRYL GRTTTKYDEL TGTGKQRRHI FEVEKEALTN YACQDADVAF QLEEVLQAQL QAEPQLLALC TTMEFPLVRV LATMEYAGIA IDTEHLARVA ETTELELQSL TDNIYAAAGS SFNIDSPKQL SHVLFTDLSL PTGKSTKTGF STDVGVLEEL AATYPIASDL LSYRTLQKLK GTYIEALPKI INPRTGRIHT SFNQHITATG RLSSSNPNLQ NIPVRTALGK EIRRAFIPST PEHWLLSADY SQIELRIAAE LSGDERLIAA FRNGEDIHTA TAQVIFGTEE ISSDMRRKAK EVNFGVLYGI QPFGLAKRLN IPQKEAKVII ETYKAKYPQL FNVLRHIIEE GKEKGYVTTL LGRRRYIADL NSRNGTVQKA AERAAMNTPI QGTAADIIKC AMNLCYQQMQ ASGMASEMLL QVHDELLFET TDSEKEALTK LVENAMKEAA VLCGMKQVPV EVDCGVGKNW LEAH
|
| |