Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_1612 |
Symbol | |
ID | 6744444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 1529552 |
End bp | 1533157 |
Gene Length | 3606 bp |
Protein Length | 1201 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 642751432 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_002122271 |
Protein GI | 195953981 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00506055 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGATT TTGTGCATCT TCATCTTCAC ACCCAATATT CGCTTTTGGA CGGAGCTATA AAGATAAAGG ATTTGGCTAA AAAGGCAAAG GAATACGGCT ATAAGGCTGT GGCTATAACG GATCATGGTA ACCTTTTTGG TACTATGAGC TTTTACAAAG AGATGAAGGC AAACGGTATA AAGCCCATTA TAGGTATGGA AGCTTACTTT ACCACAGGAA AAAGAACTGA GCATAAAGGA AAAGGTTCTG AAGATAACAT AACAGATAGA ATAAACCACC ATATCATACT TTTGGCAAAA AACGACACTG GGCTTAAAAA CCTCATGAAG CTTTCTTCTA TAGCTTTTAT TGAAGGCTTT TACTATAAAC CAAGGATAGA TTATGAAGTG TTAGAGCAGC ATGCGGAAGG TCTAATCGCT TTGACGGCCT GTCTTAAGGG AGTGCCTACT TTTTACGCAG CTCAAGGCAA CGAAGAGATG GCTTATAATT GGGTTAAAAA GTTTAAAGAT ATCTTTGGGG ATGATCTTTA TTTAGAGCTT CAATCAAATC ATATACCAGC TCAGGAAACC GCCAATAAAA CCCTTATAGA TATAGCCAAA AGATACAACG TAAAGTTTGC GGCCACCAAC GATTGTCATT ACCTTTTGGA AGAAGATTTG AGGGCTCACA ACGTGTTGAT GGCAATTCAG ATGAAAAAAA CACTTCAGGA GCTTGGAGAA GATGCTTTTG GGCACTATGA TGGTATGCAC TTTGCCTCAT ACGAGGAGAT GGTAAAGAAA TTTGAAGGAA AGTGGAGTGA GTGGGAAAAA GCGCTTTTAA ACACTGTTGA AATATCAGAA AAAGTTGCAG ATTCTTTAAG CATATTTGAA GACAAATCTT ACAAGTTTCC TGAGTTTTTT AAAGGTGATG ATATAAATGT CAATATATCT TCTTACCTTA GAAATTTAGC CGTTGAGGGT CTTAAAAGCC GTATACAAAA ATCTCAAATA AGCAAAAATA TACCAGAAAA AGAGTACTGG GACAGACTAA ATTATGAATT AGAAGTAATA CAAAATATGG GTTTTGATAG TTATTTCTTG ATAGTATCGG ATTTTATAAA CTGGTCAAAA TCAAACAACA TACCCGTGGG TCCTGGAAGA GGTTCTGCGG CTGGGTCTTT GGTGGCTTTT GCTCTAAACA TTACAGACGT TGATCCGTTA AAGCACGGAC TTATCTTCGA AAGATTTTTA AATCCAGAGC GTATATCTAT GCCAGATATA GATGTAGACT TTTGCATGGA CAACCGGGAT AAGGTTATAG AGTATGTAAA AGAAAAATAC GGCAAAGATT CTGTAGCTCA GATCATTACT TACAATACGA TGAAAGCTAA ACAAACCCTT AGAGATGTGG CAAGGGCCAT GGGTATGGCT TACAAAGACG CCGATGTACT AGCAAAACTT ATACCTCAAG GCAACGTCCA AGGTACTTGG CTAAGCCTTG AAGAGATGTA TATAACGCCC ATAGAAGAGT TGATGGAGCG TTATGGACAC AGAGGAGATA TTGAAGATAA TGTAAAAAAA TTTAGAGACC TTGCAAAAAA AGACCCTCAG ATAAAAGAGC TTGTGGAAAT ATCTATAAAA CTTGAGGGTC TTACAAGGCA TACATCTTTG CATGCTGCTG GTATTGTGAT AGCTCCAAAA CCCCTTATAG AGTTGGCACC TTTGTATCTT GATAAATCGG TAAAAGATGA TACTGGCAAC ATAGCTACCC AATACGATAT GGCTCATTTA GAAGAGCTTG GGCTTGTCAA AATGGACTTT CTCGGATTAA AAACGCTTAC AGAGCTATGG AAGATGAAAG AGCTTGTAAA ACAAAACAAA GGTGTAGATA TTGATTTCTT GTCTTTGGAT TTTAACAACA AAGAAATTTA CGAATTTTTA GCTACCGGTG AAACGATAGG GGTGTTTCAA CTGGAATCAA AGGGCATGAG GGAGCTTATA AAACGTTTAA AACCAGATAG ATTTGAAGAT ATTGTAGCCG CTTTAGCCCT TTATAGACCA GGTCCTATAA AAAGCGGGAT GGTAGATAAA TTTATAAATA GAAAACTTGG TAAAGAAAAA GTTGTATACG AATTTGAAGA GCTTGAAGAG GTGCTAAAAG AAACTTATGG TCTCATTGTT TATCAAGAGC AGATCATGTT TATATCAAAC ATTCTTGCTG GGTTTACCAT GGGAGAGGCA GATAATTTGA GAAAGGCTAT AGGTAAGAAA AAGGCTGATC TCATGGCAAA GATAAAAGAT GATTTCATAA GAAGAAGCTG TGAAAGAGGC TATCCTAAGG ATAAAATAGA AAAACTTTGG TCTGAAATAG AGGAGTTTGC CTCTTATTCT TTCAACAAAT CTCACTCTGT GGCTTACGGT TACATATCTT TTTGGACTGC TTACGTGAAG CTTTACTATC CAGATGAGTT TTATGCTGTG AAATTTTCTA CAGAGAACTC CGATAAGAAG TTTATAAACC TTTTAAAAGA TGCGAAGTCT TTTGGTATAA AAGTATTACC TCCAGATGTA AACAAATCCG ATGTAGATTT TAAGATAGAA GTTCCAAGAC ACATAAGGTT TGGGCTTGGA AGAATAAAGG GTGTTGGTGA GGATACCGCA AGACATATAA AATCTATGCA AGAAAAAATT GGAAGAGACT TTCTAAGTTT TCAAGATTTT ACCAAAAACA CAGACAATAG GAAAGTGAAT AAAAAAGTTT TAGAAGCTCT TTCAAAAGCA GGGGCTTTTG ATAGTCTTAT AGAAAAAAGC GGTTATAAAA ATAGAGAAGA TTTTCTCTCA AGGCTTTTGA ATTCTCAGGA TATATCAAGC ATAGCCCAGA GGTCGTTGTT TGGTATAAAA GCCTCGAAAT CCACTGAGCA GATGGAACAA AGCTCTTCTA TAGATATATT AAAATTAGAA AAGGAGGTTC TTGGTTTTTA TATATCTGGG CATCCACTGG ACAAATACAG CTGGATTTTA TCTCACAACA AAGACATAAC AAATATAGAA GATATAGATT TTGACAATAT GCCAAACCAA GCCATAGACA GCGAGGCTGG GCAAAACGCA TCGGCGGAAT ATACGATTGT GGGTGTTATA AGCGATTTAC AGATTAAAAA AACCAAAAAT GGTTCTTACA TGGCTATTTT TAACCTTGTG GATAAAACGG ATATTGTTGA AGTTGTGGTT TTCCCGGATA GGTATGCATC TTCTGAAGGT ATTATAAAAG AAGATGAAGT GGTGGTGGTA AAGGGTGTTT TAGACATAGA CATTGAAAAT GAAAACCTTA AGATAGTGGC AAATGATCTT TATTCTATAG ATTCTTTGCT AAATCAATAC AACACGCTAA CTCTTAAGAT AGATGAGGAA AAAGCGAAAA ACGGCTTTTT AGAAAAGCTT AAATCTATGC TGGACAAATA TACGCCTAAG ACTAACGAAG ATGTTATGAA AACTCAAAAA GTTGTATTGG AACTAGACGT TAGCTCCTAT AGGGCTATAG TACAAACCTC CAAGGATGTG GTGCTTTCAA AAACCCTCAT AGAAGAACTC AGTAAATACG GTATAGAATT TTCTTTGGCA TCTTAA
|
Protein sequence | MKDFVHLHLH TQYSLLDGAI KIKDLAKKAK EYGYKAVAIT DHGNLFGTMS FYKEMKANGI KPIIGMEAYF TTGKRTEHKG KGSEDNITDR INHHIILLAK NDTGLKNLMK LSSIAFIEGF YYKPRIDYEV LEQHAEGLIA LTACLKGVPT FYAAQGNEEM AYNWVKKFKD IFGDDLYLEL QSNHIPAQET ANKTLIDIAK RYNVKFAATN DCHYLLEEDL RAHNVLMAIQ MKKTLQELGE DAFGHYDGMH FASYEEMVKK FEGKWSEWEK ALLNTVEISE KVADSLSIFE DKSYKFPEFF KGDDINVNIS SYLRNLAVEG LKSRIQKSQI SKNIPEKEYW DRLNYELEVI QNMGFDSYFL IVSDFINWSK SNNIPVGPGR GSAAGSLVAF ALNITDVDPL KHGLIFERFL NPERISMPDI DVDFCMDNRD KVIEYVKEKY GKDSVAQIIT YNTMKAKQTL RDVARAMGMA YKDADVLAKL IPQGNVQGTW LSLEEMYITP IEELMERYGH RGDIEDNVKK FRDLAKKDPQ IKELVEISIK LEGLTRHTSL HAAGIVIAPK PLIELAPLYL DKSVKDDTGN IATQYDMAHL EELGLVKMDF LGLKTLTELW KMKELVKQNK GVDIDFLSLD FNNKEIYEFL ATGETIGVFQ LESKGMRELI KRLKPDRFED IVAALALYRP GPIKSGMVDK FINRKLGKEK VVYEFEELEE VLKETYGLIV YQEQIMFISN ILAGFTMGEA DNLRKAIGKK KADLMAKIKD DFIRRSCERG YPKDKIEKLW SEIEEFASYS FNKSHSVAYG YISFWTAYVK LYYPDEFYAV KFSTENSDKK FINLLKDAKS FGIKVLPPDV NKSDVDFKIE VPRHIRFGLG RIKGVGEDTA RHIKSMQEKI GRDFLSFQDF TKNTDNRKVN KKVLEALSKA GAFDSLIEKS GYKNREDFLS RLLNSQDISS IAQRSLFGIK ASKSTEQMEQ SSSIDILKLE KEVLGFYISG HPLDKYSWIL SHNKDITNIE DIDFDNMPNQ AIDSEAGQNA SAEYTIVGVI SDLQIKKTKN GSYMAIFNLV DKTDIVEVVV FPDRYASSEG IIKEDEVVVV KGVLDIDIEN ENLKIVANDL YSIDSLLNQY NTLTLKIDEE KAKNGFLEKL KSMLDKYTPK TNEDVMKTQK VVLELDVSSY RAIVQTSKDV VLSKTLIEEL SKYGIEFSLA S
|
| |