Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1095 |
Symbol | |
ID | 8806855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 1159343 |
End bp | 1162477 |
Gene Length | 3135 bp |
Protein Length | 1044 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_003460343 |
Protein GI | 289208277 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCCCA CTACCCCCGA ACCGCCGGAG TACGCCGAGC TTATGGCGGT CAGCAACTTC TCGTTCCTGC GCGGGGCTTC GCACCCGGAG GAGCTGGTGG AGCGGGCCCG GGCGCTGGGC TATCGCGCGC TGGCGCTGAC CGATATCGCC TCGCTGGCCG GCGTGGTGCG CGCGCACGCG GCGGCGAAGG AATGTGGCTT GCACCTGATC CTGGGGGCGA CCTTCCAGCT CGCGGACGGC CCCGCTCTGA CCCTGCTGGT ACGCAATGCG GCGGGCTACG CCGCGCTGTG CCGCCTGATC ACTCAGGCGC GCGGGGCAGC GGACAAGGGG CATTACCACC TGACCCGGGC GGACCTCGAC GTCCTGCCCG CCGAGGCGCG GGCTGGCTGG CACGTCCTGC TGCGTATCGA ACCCGAGCCC GCAGAACCGG CGCTCGCCGA AGCCGTCCGC TGGCTGGCCG GATGGGCCAC GCCGGGTCAG GCCCATCTGG CGGTCAGCCA GTTGCGTACG CCGGAGGATG CCACGCTCGC AACCCGCGCA CGACAGCTCG CCGAGGCGAC GAGACTCCCG ATCGTCGCCT GCGGCGACGT GCACATGCAC AGCCGGGGCC GGCGTGCGTT GCAGGACGTG CTCACCGCTC TGCGCCAGCG CACCACGGTC GCCGAGGCGG CCCCGGCGCG CTTCCCCAAC GGCGAGCGCT GCCTGCGCCC GCGCGACGAC CTGGCCACGT TGTATCCCGC CCCCTGGCTC GCGGCCAGTG TGCGGATCGC GCGGGACTGC GACTTTTCGC TGGATGCGGT CACCTACCGC TACCCGACCG ATCCCCTGCC CCGGGACACG AGCCCGGCCG CCTACCTGCG CGAGGAAACC CTCAAGGGGG CACGGGGGCG CTGGCCCGAC GGCATCCCGC CGAAGGTGAC GGAGCAGATC GAGCGCGAGC TCGGGATCAT CGCGCAGCTT GGCTACGAGC CCTACTTTCT GACGGTCTAC GACATCGTGC GCTTTGCCCG CGGCCGCGGG ATCCTGTGCC AGGGGCGCGG CTCGGCGGCG AATTCTGCCG TGTGCTATGC CCTCGGCATT ACGGCGGTGG ACCCCTCGCG CTCCAGCATG CTGTTCGAGC GCTTCATCTC CGCCGAGCGC GGCGAGCCAC CGGATATCGA CGTCGACTTC GAGCACGAGC GCCGCGAGGA GGTCATCCAG TACATCTACC AGCGCTACGG CCGCGAGCGC GCGGCGCTGG CCTGTACCGT GATCCGCTAC CGCCGCCGCA GCGCCCTGCG CGACGTCGGC CGCGCCCTGG GCATCTCGCG CGCACGACTG GATGCGCTGA CCGCCAACCT CGCCTGGTGG GACGACGGCA TCCCCGCCGA GCGGCTGCGC GAGGTCGGGC TGGACCCGGA TGGCGCCCTG GCCCGGCAGT GGCAGGCCCT GACCGCCGAG CTGCTGGGCT TTCCCCGCCA CCTGTCGCAG CACACCGGCG GCTTCGTGAT CGCCGAGGGC CGGCTGGACG AGCTGGTGCC GATCGAGAAC GCGGCAATGC CGGAGCGCTC GGTGATCCAG TGGGACAAGG ACGACCTGGA TGCCGTGGGC CTGTTGAAGA TCGACGTGCT GGCACTGGGC ATGCTCTCCT GCGTGCGCCG CGCGCTGGAG CTAATGGCCC AGCACCGGGG CCGCGACTGG GGCCTGGCCG ACGTCCCCGC CGAGGACCCG GCGGTCTATC GCATGCTGCA GAAGGCCGAC ACCCTGGGCA TATTCCAGAT CGAGTCGCGC GCGCAGATGA GCATGCTGCC CCGCCTGAAA CCGGCAAACT TCTACGACCT GGTGATCGAG ATCGCCATCG TGCGCCCCGG CCCGATCCAG GGCGAGATGG TGCACCCGTA CCTGAAGCGC CGCGCCCACC CGGAGCGGGT AAGCTACCCG AACGCAGAGG TGAAGCGCGT ACTCGGGCGC ACCCTGGGCG TGCCGATCTT CCAGGAACAG GTGATCGAGC TGGCGATGGT CGCGGCCGGC TTCTCCGCTG GCGAGGCGGA CGGGCTGCGC CGCGCGATCG GGGCCTGGCG CCGCACCGGC CAGCTCGCGC GCTACCGCGA ACGCTTGATG GACGGGATGC GGCAACGCGG CTATCCGGAG AAATTCGCCG AGCAGCTGTA CAAGCAGATC CTCGGCTTTG GCGAGTACGG CTTTCCCGAG TCCCACTCGG CCAGCTTTGC GCTGATCACC TGGGTCTCGT CCTGGCTCAA GTGCCATGAA CCGGCGATCT TCGCCTGCGC CCTGCTGAAC AGCCAGCCGA TGGGCTTCTA CGCCCCGGCG CAGATCGTCG CCGATGCGCG CCGCCACGGC GTGGAGGTGC GCCCGGTGGA TGTGCGCTGC AGTGACTGGG ACTGCACGGT GGAGCCCGTA GCCGGAAGCG ACGCGCTGGC CCTGCGCCTG GGCCTGAACC GGGTCGGCGG GCTCGCGCAG GCCGTGGCCG AGCGTGTCGT CGCCGCGCGC GCCGAACGCC CGTTCGCGGA CGTACCGGAT CTGGTCGCCC GCGCTGCACT CACCCGCGCC CAGGCCAACC GCCTGGCCCG CGCCGACGCC CTGAAGGGGC TGGCCGGCAA CCGCCACCAG GCGCGCTGGG CGACCGAGGC GGTGGAGGAA CCCCTGCCCC TGTTCGCCGA CGCCCCGGCC GAGACGGTGC GCGAGGCCGA CGTAACCCCG CTGCTGCCGG GCCCGGATGC GGTGGACGAC CTGGTGCAGG ACTATGCCAC CCAGGGCCTG AGCCTGGGCC CGCACCCGCT GGCGCTGATG CGCCCGTTCC TGGAGCAAAG ACGTTTACCC ACCGCCGCCC GGCTACTGGA ACGCCCCGGG CACCCCGGCG CCCGCTATGC CGGCCTCGTG ATCACCCGCC AGCGGCCCGG CTCGGCCCAT GGCACGGTAT TCCTGACACT GGAGGACGAA AGCGGCACGC TGAACCTGAT CGTCTGGCCG GACCGGGTGG AGCGCTTTCG CGCCGAGGTC CTGCACGGCC GCCTGCTGGA GGCCCACGGC GAGTGGCAGT ACAGCGAGGG GGTCGGCCAC TTGATCGTGC AGCGCCTGAT CGACCGCAGC GACTGGCTGG GCGAGCTGAC CACCCGCTCC CGCGACTTCG GCTAG
|
Protein sequence | MDPTTPEPPE YAELMAVSNF SFLRGASHPE ELVERARALG YRALALTDIA SLAGVVRAHA AAKECGLHLI LGATFQLADG PALTLLVRNA AGYAALCRLI TQARGAADKG HYHLTRADLD VLPAEARAGW HVLLRIEPEP AEPALAEAVR WLAGWATPGQ AHLAVSQLRT PEDATLATRA RQLAEATRLP IVACGDVHMH SRGRRALQDV LTALRQRTTV AEAAPARFPN GERCLRPRDD LATLYPAPWL AASVRIARDC DFSLDAVTYR YPTDPLPRDT SPAAYLREET LKGARGRWPD GIPPKVTEQI ERELGIIAQL GYEPYFLTVY DIVRFARGRG ILCQGRGSAA NSAVCYALGI TAVDPSRSSM LFERFISAER GEPPDIDVDF EHERREEVIQ YIYQRYGRER AALACTVIRY RRRSALRDVG RALGISRARL DALTANLAWW DDGIPAERLR EVGLDPDGAL ARQWQALTAE LLGFPRHLSQ HTGGFVIAEG RLDELVPIEN AAMPERSVIQ WDKDDLDAVG LLKIDVLALG MLSCVRRALE LMAQHRGRDW GLADVPAEDP AVYRMLQKAD TLGIFQIESR AQMSMLPRLK PANFYDLVIE IAIVRPGPIQ GEMVHPYLKR RAHPERVSYP NAEVKRVLGR TLGVPIFQEQ VIELAMVAAG FSAGEADGLR RAIGAWRRTG QLARYRERLM DGMRQRGYPE KFAEQLYKQI LGFGEYGFPE SHSASFALIT WVSSWLKCHE PAIFACALLN SQPMGFYAPA QIVADARRHG VEVRPVDVRC SDWDCTVEPV AGSDALALRL GLNRVGGLAQ AVAERVVAAR AERPFADVPD LVARAALTRA QANRLARADA LKGLAGNRHQ ARWATEAVEE PLPLFADAPA ETVREADVTP LLPGPDAVDD LVQDYATQGL SLGPHPLALM RPFLEQRRLP TAARLLERPG HPGARYAGLV ITRQRPGSAH GTVFLTLEDE SGTLNLIVWP DRVERFRAEV LHGRLLEAHG EWQYSEGVGH LIVQRLIDRS DWLGELTTRS RDFG
|
| |