Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0936 |
Symbol | |
ID | 7085039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1026770 |
End bp | 1028713 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643697958 |
Product | KAP P-loop domain protein |
Protein accession | YP_002354598 |
Protein GI | 217969364 |
COG category | [R] General function prediction only |
COG ID | [COG4928] Predicted P-loop ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCTGA CCGACAACGA AACCCGGGTC GACCTGCTGA ACAACGAGGC CATCGCGACC ACGATCATCG AGTTGCTTCG CGCCAGGCCC GACCATCCGG TGACGATCGG CGTGCATGGC GACTGGGGTG CGGGCAAGTC CAGCGTGCTC GAAATGATCG AGGCGGGGTT CGCCGACAAG GACGAAGTCC TGTGCCTCAA GTTCAACGGA TGGCGCTTTC AAGGCTTCGA GGATGCCAAG ATCGCCCTGA TCGAGGGGAT AGTCACGGGC TTGGTCGAGA AGCGTCCGGC CCTGAACAAG GCCGCGGCCG CGGTCAAGGA TGTCTTCCGG CGCATCGACT GGCTAAAGGT GGCGAAGAGA GCCGGCGGCC TCGCCCTGAC CGCGTTCACA GGCATTCCGA CACCCGATCA AATCGGAGCT ATCGTCGGCT CGCTCGAAGC GGTGATGGCG GATCCTGCCA AGCTCGCCAC GAAAGAAAAC CTCTCGACGG CGATCGACGA GGTGAAGGCC GTGCTGAAGC CGGGCGAGAC AAAGAATGTG CCCGAGGAAG TGGAAGCGTT CCGCATGGCC TTCGACCAGC TCCTGAAGGA CGCGGGGATC AAACAGCTCA TCGTCCTGAT CGATGACCTC GACCGCTGCT TGCCAGACAC CGCCATCGAA ACGCTCGAAG CGATCCGGTT GTTCGTGTTC ACCGCCCGGA CGGCGTTCGT CGTCGCGGCC GACGAGGCGA TGATCGAGTA TGCGGTGCGC AAGCACTTTC CCGATCTGCC GGACAGCACC GGACCGCGGG ACTACGCACG CAACTACCTC GAAAAGCTCA TCCAGGTCCC ATTCCGTATC CCGGCGCTGG GCGAGACCGA GACACGGATT TACGTCACGC TGCTGCTGGC CGGCGCCGAA ATCGGTGAAA ACGACGCCGA CTATGCGAAC TTGATTGGCG TGGCGCGTGA GAAGCTGAAG CGTCCTTGGA CCAGCGGCGG TTTGGATGCG GCGACCGTCA AGACGGCGCT CAGCAAGCAA GCCGAGAAGG CGAACAACGC ACTTGCCCTC AGTGACCAGA TTGGCCCCAT CCTGGCGAGC GGTACGAAGG GCAACCCGCG TCAGATCAAA CGCTTCCTCA ACACGCTGCT GCTGCGTGAG CGTACTGCGG CCGCGCGCGG ATTCGGGGAC GATATCAAAC TGCCCGTGCT CGCGAAGCTC ATGCTCGCTG AGCGCTTCAT CCCGAGGCTG TTCGAGCAGA TCGCGTTTGT CGCCGCGGTC CACCCGCAGG GGCTATGCGA GGACCTCGAA ACACTTGAAA AGGGGCTGGC GACGGCCGAC GGTAAGGAAT CGCAGGGCGG CGAGCGGAAG GGACAGAGAG CCGGCGAGCC AGGGCCCGCG CTTGACAACG CCGTGTTGAC TGAATGGCGG TCGTCAGAGA CGGTGTGCGA CTGGGCGCGC CTGTCACCCA AACTCTCGGG TCTCGATCTA CGCCCCTACT TGTTCGTGAC AAAGGACAAG AAGGACTACT TCGGCCCGGT GTCCGTGCTG GGTCACTTGG CCGGCGTGGT GGAGAAGCTG TTCAGCGGCA AGATGACCGT CCAGGGCTAT GAGGCCGAGT TGAAGCAGCT CGTGCAGCCC GAAGCGGAGA AGGTGTTCGA GGCCGTGCGC ACCAAGATCA TGAGCACGGG CACCTTCGAC ACCAAGCCCG CGGGTGTTGA TGGACTTGTC GTCCTCGTGA AGGCGCAACC TGGCTTGCAG GGCCGGTTGA TGGACTTCTT GGAAGCATTG CCAAGTGGCA AGTGTGGCCC ATGGGCTGTC AGCGGCTGGC AGGGTGTCAT CAAAGATGCC GAATGCGCCG CTCGTCTGAC AAAGCTGCTG GGCGAGTGGA GCAAGGTCGC CAATAACCCC GGCCTCAAAG CAAGCGCCGA AGCGGCCCTC AAGGACGTAA AGGGAGGACG CTGA
|
Protein sequence | MILTDNETRV DLLNNEAIAT TIIELLRARP DHPVTIGVHG DWGAGKSSVL EMIEAGFADK DEVLCLKFNG WRFQGFEDAK IALIEGIVTG LVEKRPALNK AAAAVKDVFR RIDWLKVAKR AGGLALTAFT GIPTPDQIGA IVGSLEAVMA DPAKLATKEN LSTAIDEVKA VLKPGETKNV PEEVEAFRMA FDQLLKDAGI KQLIVLIDDL DRCLPDTAIE TLEAIRLFVF TARTAFVVAA DEAMIEYAVR KHFPDLPDST GPRDYARNYL EKLIQVPFRI PALGETETRI YVTLLLAGAE IGENDADYAN LIGVAREKLK RPWTSGGLDA ATVKTALSKQ AEKANNALAL SDQIGPILAS GTKGNPRQIK RFLNTLLLRE RTAAARGFGD DIKLPVLAKL MLAERFIPRL FEQIAFVAAV HPQGLCEDLE TLEKGLATAD GKESQGGERK GQRAGEPGPA LDNAVLTEWR SSETVCDWAR LSPKLSGLDL RPYLFVTKDK KDYFGPVSVL GHLAGVVEKL FSGKMTVQGY EAELKQLVQP EAEKVFEAVR TKIMSTGTFD TKPAGVDGLV VLVKAQPGLQ GRLMDFLEAL PSGKCGPWAV SGWQGVIKDA ECAARLTKLL GEWSKVANNP GLKASAEAAL KDVKGGR
|
| |