Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4047 |
Symbol | |
ID | 4072469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4781910 |
End bp | 4785122 |
Gene Length | 3213 bp |
Protein Length | 1070 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637986078 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_593121 |
Protein GI | 94971073 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCAC AAAACCAATA CATCGAGTTG CATGCGAACA GCGCGTTCAG TTTCCTGCGT GGGGCTTCAC TGCCTGAGAC GCTGATTTTT CGCGCGCAGC AACTCGAATA TCCGGCGATG GCGCTGATGG ATGGAAACGG CGTTTATGGC TCGGCGCGGT TTCATCTCGC TGCGAAGCCG AATGGCGTAC GTGCGCACGT GGGCAGCGAG ATTGCCGTGC GCGATCTCGG TGAGCGCATG GCGCCGGCGG CGTGGTTGCC GCATCAATGG CCGGCGGAAC CGGTGCGTCT ACCGCTGTTA TGCGAGTCGC GCGAGGGATA CCAGAACCTG TGCCGCATGG TGACGCGGCT GAAGATGCGC GAGCCGTCGA AGGCGGAAGG CGCGGCGGTG ATGGCCGATG TGGAAGAGTT CGCGCGCGGT CTGGTGTGTT TGACCGGCGG CGACGAGGGT CCACTGGCGT CGGCGCTGGC GCGTGGTGGC GAGCCCGAAG GACACAAAAC TGTAGAGCAC CTGGTGCGCA TGTTTGGGCG CGAGAATGTT TTCGTCGAAT TGCAGCGGCA TGGATTGCGT GAGCAGGAGT GGCGCAACCA GGCGGCGGTG CGGATTGCGC GCTCGCTGCA GCTGCCGATT CTCGCCACCA ACGGCGTGCG TTATGCCGAT GAGCCGGAGC GCGAGATTGC CGATCTGTTT ACGGCGATCC GCCATCACGT GGCACTGAAG GATGCGGGGC GTCTGCTGGC GCAAAATTCG TGCCGGTATC TGCGTCCGAC GGCGGAGATG GCGCGGTTGT TTCGCGACCT GCCGGAGGCG GTTGCGAATA CGTGGGAGTT GTCGCAGCGG CTGACGTTCG AGCTCGACGA TCTTGGTTAC CAGTTCCCGA TTTATCCCAC GCCGGATGGC GAATCGATGG ACTCGTTCCT GGAGAAACGC GTACAGGAAG GCGTGATCAA GCGCTATGGA GCGAAGAACG AGCCCGATCT CTACGCTCGT GCACAGAAGC AAGTTGCGCG CGAATTGGCG CTGATCAAGA AGCTCGGTCT CGCCGGATAT TTTCTGATCG TATGGGACAT CATTCGCTTC TGCCAGCAGA ACGACATTCT CGTGCAAGGG CGCGGCAGCG CCGCGAATTC CGCGGTTTGT TACGCGTTGG AGATCACGGC GGTTGACCCG GTTGGCATGG AGTTGCTGTT TGAGCGCTTT TTGAGCGAAG CGCGCGGCGA GTGGCCGGAT ATTGACCTCG ATTTGCCATC GGGCGACAAG CGCGAGCAGG CGATCCAGTA CGTGTACACG CGTTACGGGC ATCTAGGCGC GGCGATGACT GCAAATGTCA TTACCTATCG TGGAAAATCC GCGGCGCGAG AGGTTGGCAA GGCGCTTGGT TTCGACGTCG AGACGTTGAA CAAGCTGACA AAGCTGGTGA GCACGTGGGA GTGGCGCGGG CCGAATGACA CGCTGGAAAA CCAATTCGAA TCGGCGGGAT TTGAAGCGAA ACATCCGCAT ATCGCGAAAT ATCTCGACCT TTGCAGCCGT ATCCAGGATT TGCCGCGGCA TCTCGGGCAA CACTCCGGCG GGATGGTGAT CTGCCAAGGG CAGCTCGACA GCGTGGTGCC GATCGAGCCG GCGTCGATGC CGGGCCGGTC GGTCGTTCAA TGGGATAAAG ACGATTGCGC CGACATGCAC ATCATCAAAG TGGACCTACT TGGACTGGGG ATGATGGCGG TAATCAAGGA TTGCGTGGAC TTGGTACCGC GGCACTACGG TGTTCCCGTG GATTTGGCGC AACTACCCCA GAACGATGGG GATGTTTATC GCACGCTGCA GAAGGCGGAC ACAGTCGGCA TGTTCCAGGT GGAAAGCCGT GCGCAGATGG CGTCGCTACC GCGGAATTAT CCAACGCGGT TTTACGACAT CGTGGTGCAG GTTGCGATCA TCCGACCGGG GCCGATCGTG GGGAACATGG CAAACCCGTA TATGCGTCGC CGACAGAAGA AGGAAGAGGT GACGTACTAC CATCCGTTGC TGGAGCCGGT GTTGAAGCGG ACGCTGGGCG TGCCGTTATT CCAGGAGCAG TTGCTCCGGA TGGCGATGAT CGTCGCAAAC TTCTCCGGCA CCGAAGCGGA GGAATTGCGA CGCGCGCTGG GAATGCGGCG CTCCACGCAG CGTATGCGCG AGCTCGGCGT GAAATTGCGC GCGGGAATGA CGGAGAACGG GTTTGATCCG AAGACGCAGG AAGAGATCAT CCAGAGCATT TCGTCGTTCG CGCTGTACGG GTTTCCGGAG TCGCACGCCG CAAGTTTCGC GCTGATTGCG TATGCGTCGG CGTATTTCAA GGTGCATTAC CTCGCGGCGT TCACGGCGGC GATCCTGAAT AACCAGCCGA TGGGCTTTTA TTCGCCGGCG GTATTGGTGA AGGACGCGCA ACGGCATGGA TTGCGCGTTT TACCGGTGGA TGTACAACGT TCGGTGTGGG ACTGCACGGT AGAACACGCC GGTGACGAGC GTAAATTGCG GTTGGGATTG CGGTATGTGC GCGGATTACG GCAGGAAACC GCAGAAATAC TGGTGAAATC GCGGGAAAAC CACGGTCCGT TCGCGTCGTC GGAGGACTTG GGCCGGCGGG TGCCGCTGCT GAACCGCAAA GAACTGGTGG CGCTGGCGCA AATCGGAGCG CTGAACTGGG TGGGCGGTAC GGCGCACCGT CGCGACGCCT TGTGGCAGGT GGAGCGGGTG AGCCGACGCC CCGGACCGCT CCTGCAATCC GCGGAGGAAG AAGAGGACGT TTCACCGCTG GCGCAGATGA ATTGGGAGGA GCGGTTAGTC GCTGATTTCA ACGGCACCGG CCTGACCGTC GGCAAGCATC CGATGACCTA TCATCGCGAG CGGCTAGCAA AGATGAGGGT GCTTTCGGCG GAACAGTTAC AGGTTGCGGA AGACGGACGG CAGGTTCGAA TCGCCGGCTG CGTGATCGCA AGGCAGCGCC CGGGGACGGC GAAAGGTTTT GTTTTCCTGA GCATCGAGGA CGAGACGGGG ATTGCGAACG CGATCATTAG TCCGCAACTA TATGAGCAGA ACCGGGTGGT GGTTTTTACC GAGCGGTTCC TGGTCGTGGA AGGCAAGCTT CAAAACCAAG ACGGCGTGAT TCACGTCAGG GCGCAGCGAG TGCAGTCGTT GCATTTGAAC CAGGTTGCAG CACCTTCGCA CGATTTTCAT TGA
|
Protein sequence | MTAQNQYIEL HANSAFSFLR GASLPETLIF RAQQLEYPAM ALMDGNGVYG SARFHLAAKP NGVRAHVGSE IAVRDLGERM APAAWLPHQW PAEPVRLPLL CESREGYQNL CRMVTRLKMR EPSKAEGAAV MADVEEFARG LVCLTGGDEG PLASALARGG EPEGHKTVEH LVRMFGRENV FVELQRHGLR EQEWRNQAAV RIARSLQLPI LATNGVRYAD EPEREIADLF TAIRHHVALK DAGRLLAQNS CRYLRPTAEM ARLFRDLPEA VANTWELSQR LTFELDDLGY QFPIYPTPDG ESMDSFLEKR VQEGVIKRYG AKNEPDLYAR AQKQVARELA LIKKLGLAGY FLIVWDIIRF CQQNDILVQG RGSAANSAVC YALEITAVDP VGMELLFERF LSEARGEWPD IDLDLPSGDK REQAIQYVYT RYGHLGAAMT ANVITYRGKS AAREVGKALG FDVETLNKLT KLVSTWEWRG PNDTLENQFE SAGFEAKHPH IAKYLDLCSR IQDLPRHLGQ HSGGMVICQG QLDSVVPIEP ASMPGRSVVQ WDKDDCADMH IIKVDLLGLG MMAVIKDCVD LVPRHYGVPV DLAQLPQNDG DVYRTLQKAD TVGMFQVESR AQMASLPRNY PTRFYDIVVQ VAIIRPGPIV GNMANPYMRR RQKKEEVTYY HPLLEPVLKR TLGVPLFQEQ LLRMAMIVAN FSGTEAEELR RALGMRRSTQ RMRELGVKLR AGMTENGFDP KTQEEIIQSI SSFALYGFPE SHAASFALIA YASAYFKVHY LAAFTAAILN NQPMGFYSPA VLVKDAQRHG LRVLPVDVQR SVWDCTVEHA GDERKLRLGL RYVRGLRQET AEILVKSREN HGPFASSEDL GRRVPLLNRK ELVALAQIGA LNWVGGTAHR RDALWQVERV SRRPGPLLQS AEEEEDVSPL AQMNWEERLV ADFNGTGLTV GKHPMTYHRE RLAKMRVLSA EQLQVAEDGR QVRIAGCVIA RQRPGTAKGF VFLSIEDETG IANAIISPQL YEQNRVVVFT ERFLVVEGKL QNQDGVIHVR AQRVQSLHLN QVAAPSHDFH
|
| |