Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0016 |
Symbol | |
ID | 7085114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 21825 |
End bp | 23813 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643697066 |
Product | hypothetical protein |
Protein accession | YP_002353715 |
Protein GI | 217968481 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGAT TCATGCAGCG TGAGGAACGC ATCAGAACAG CGGAAGGGGA GTTCATCGTT CGCTCGATCG ACCCCAACGA CAAGCTCGTC GAGCTTTTCG ATCCGAGCAG CGGGGAGCTC AAGAAGATGA CCATTCCCGC CATGCGGCGC CAGATCTCGG ACGGCTCGAT GCGGCGACTG ACCGTGAAAC CGATGTCGGG CACCGTGCGG GATCTCGCAC AAAGCGACTC CGCGAGCCGA CAGCTGCTGT TCAACCGGTT CCGCGTGCAA CGCCTCGAAA GCTGCCTGCA TCGTGGCGAC AGCAAGGCAG AGGCGATTCG AAAGCTGATC GCTGAGCCAC TCGCGCTCGA CGACGGCACC CCCGTACCCC CGATTTCAGA GCGCCAGGCG TACCGGCTCA TTGATGCCGC ATCCAGCTCC CCGCTCGAGC TGATGCCCGC TCACGCCGAA CGCGGAAACC GCCTTCCGCG TCACTCGAAG GACGTTGAGG AACTGGTGCG CCACCTCATC GAAGAGGAGT ACGCGAAGGT TCACTCGCGC ATCACGATGC GGAAACTGTC CGAGCTCGCA ACGACGCTCG CTCGCGAGAA AGGGCTGATC GCCACGAGCC GCCGCATCAG TCGGGCCTAT GTGCGGGACT TCTTCATCAA ACGCTACCAC GGCGATATCG ACCACAAGCG CATCGATCCG CGTATCGCGA GGTCAAAAAA GGCCGTCGCC AAAGAGCGGA TCCGCGTCGA CGCCGCACTG CAGCGCGTCG AGCAGGACAC GAACTCTTTG CCCTTCATCG TCATGACGTC AGAAGGCCCG CTCAAGAATC CGTATCTGAC GGTGTCCATC GACTGCGGCA CCGGCGTCCC GCTGGGGTGG CACCTGTCCA AGACCCCCGT CACGGAAGAA GAGACGCTCG ACTGCCTCGA GAGGAGCCTC TACTCGAAAG CCGAGCGCTT CAGCCAGCTC AGCATCGACT GCAGCATCGA TCCGTACGGC CTGTTCGCAA ACTTGTACCT GGATAACGGC CCCGAAAACA AGGGCCGCCG CATCACCCGC ATCACCGAAA TCGGCATCTT CCTCACGCGC GTCGCGGCAA ACTCGGGGCA TCAGAAGCCT TACATCGAAC GATTTTTCGG CAGCCTGAAA CCCGCGCTCG AGGCTCTGCC CGGCTGCACG CGGTTCGACG GAAAGGACGG TGCGCGAACC GACGAAGCCA TGAAGGACGA TCTGTTGACC CTCGACGAGC TCGAGCGGTG GATCGTACGT TGGATGTATG AAGAGTGGGT TCACAGACCG CTTGAGCGCT TCATCACGGC CGATTATTAC GACACGGACG AAGCCCCTGG CATCACCCCG GCGACACGCT GGGCTTACTA CGAGCAGAAC ACGAGTCTGC CGCCCCCGCC CGACCGCGAG CGTTGGATCA GGATGCGCTA CCTCACCGAT GAGCGCAGCT TGAGCGCGAA GACGGGGCTG TCGATCGAAG GCTTCCGCTA CAGGGGTGAG CACCTGCGCC AGCTCATCCG CCAATATGGG CCAGACTCGC AAGTCACGGC CTATTACAAC CCGAGCGACT ACCGCTTCGC TTATGTTGCC GACCGGGAGA CCGGCGAGCT GCTGCGTCTC GTCAATGACG AGGTCAATGC CACGACGCCT GCATTCTCGT TCACCGAGGC CAAATTGCGG CGCGCGCACG TCCGCAAGAC CGCTGCGGCG GTCCCTGCTT GCGTCGCGAA CTTCCAGCGT GAATTGGCCG CAGCGTCGCT CTCGGGTGGC CGCCGCAAAC GCGGGCACAT GGCAGAGCAG CGCGAGGTCC GTGCCGCCGC AAAGCTCAGC AAGGCAATTC AGAAAAGCGA GGCCAACCCC GTCCCGGCGC AGCCCGCCGG AGACGCTCCG ACGCTGTGGA CCGACGGGCT GTTGCTGACC GACGACGCCA TCCCCAATTA CGATCTCGAG ACGAAGCCGC GCCAGTCCAA CGGGGGCACC GAGTCATGA
|
Protein sequence | MSRFMQREER IRTAEGEFIV RSIDPNDKLV ELFDPSSGEL KKMTIPAMRR QISDGSMRRL TVKPMSGTVR DLAQSDSASR QLLFNRFRVQ RLESCLHRGD SKAEAIRKLI AEPLALDDGT PVPPISERQA YRLIDAASSS PLELMPAHAE RGNRLPRHSK DVEELVRHLI EEEYAKVHSR ITMRKLSELA TTLAREKGLI ATSRRISRAY VRDFFIKRYH GDIDHKRIDP RIARSKKAVA KERIRVDAAL QRVEQDTNSL PFIVMTSEGP LKNPYLTVSI DCGTGVPLGW HLSKTPVTEE ETLDCLERSL YSKAERFSQL SIDCSIDPYG LFANLYLDNG PENKGRRITR ITEIGIFLTR VAANSGHQKP YIERFFGSLK PALEALPGCT RFDGKDGART DEAMKDDLLT LDELERWIVR WMYEEWVHRP LERFITADYY DTDEAPGITP ATRWAYYEQN TSLPPPPDRE RWIRMRYLTD ERSLSAKTGL SIEGFRYRGE HLRQLIRQYG PDSQVTAYYN PSDYRFAYVA DRETGELLRL VNDEVNATTP AFSFTEAKLR RAHVRKTAAA VPACVANFQR ELAAASLSGG RRKRGHMAEQ REVRAAAKLS KAIQKSEANP VPAQPAGDAP TLWTDGLLLT DDAIPNYDLE TKPRQSNGGT ES
|
| |