Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3160 |
Symbol | |
ID | 7874301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3426348 |
End bp | 3427538 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643700089 |
Product | hypothetical protein |
Protein accession | YP_002890133 |
Protein GI | 237653819 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.44575 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAGTA CGGCTAAGCC TGTTGACCGG GAAACGCTTT ACAACGAGGT CTGGACTGAG CCGGTGTCCG TTGTCGCCCC TCGATACGGG CTGTCTGACG TCGGCCTCGC CAAGATATGC CGGTCTTTGG CCATCCCCCT GCCAAGTCGG GGGTATTGGG CCAAAGTCAA AGCGGGTAAG GTCATGCACC GGGTCCCCCT GCCCCCCCTG AAGCACTCGG GCGCAGTTCC CACGGGATTG GTCAAGTTGC CGTCTGAAAA GGTAGCAGTC CGGGAAGCTG CAAGAAAGAC GGCGGCTCGG GTGCGGAAAG AAGTCCCACC ACTTCCTCCT CCCGAAGAAG TGGGCACCCC TCACTCGCTC GTGGTGGCTA CATCGAAGCG GCTGCGCAGG CGTGAAGGCT GGCCTGAGGG CACTATGCTA CGCTCGGCCC CGAAGGAAGT GCTCAACCTT TCCGTGACCA AGGACGCCCT CGACCGAGCA CTTGCACTGA CGGATGCGCT CATCAAGGCG CTCGAGAAAG AGGGCTTCTC CTTCGAAATC GATGCCGAAA AAGGGGCCAC CTGGGTCAAG TGGTTGGAGA CCGGTACGAA GATGGCCTTC GCCATCAGCG AGCACGTCAA GCGCAGTGTG CATGTGGTTA CGCCTGCGGA GGAGCGTGCC CGGAAGCGGT ACTGGGATCG GTCGCGCTGG GACCACGCTG CAAGTTATCC AAGCATCCCA CAACATGACT ACACTCCGAC AGGCACCCTG ACGATAGAGG TAGGGCGCTG GCCATCCCGC AAATGGAACG ACACACCGAG AACGCAGCTA GAAAGTCGCC TCGGGGAGGT CGTCGGTGGC GTCATCGTCC TGGCGAGGGA CATCCATGCC AAAGAGCAAG AGGAGGCACG CCGCAAGGAG GCCTACCGCC TCGCAGTGGA GCGCTATGAG TTCCTCACGA CCCGCCATGC CGATGAAGTC GCTCGCTTCG AGGCGCTTGA GGCCGATGCA GCGAACTGGG AGCGAGCAGC AAAGCTTCGT GCCTTTGCTG ATGCGAAGGA AAGGCAACTT CGTGCAGTGG GTGGGCCTTC GGCTGAGCAG GCCGACTGGC TTGCCTGGGC GCGGGCAAAG GCCGACTGGC TGGACCCCTT GGTGCTGGTT TCTGACGTCA TTCTGGATGC GCCAGAGCCC AAGCGGCCCG GCTACTGGTA G
|
Protein sequence | MESTAKPVDR ETLYNEVWTE PVSVVAPRYG LSDVGLAKIC RSLAIPLPSR GYWAKVKAGK VMHRVPLPPL KHSGAVPTGL VKLPSEKVAV REAARKTAAR VRKEVPPLPP PEEVGTPHSL VVATSKRLRR REGWPEGTML RSAPKEVLNL SVTKDALDRA LALTDALIKA LEKEGFSFEI DAEKGATWVK WLETGTKMAF AISEHVKRSV HVVTPAEERA RKRYWDRSRW DHAASYPSIP QHDYTPTGTL TIEVGRWPSR KWNDTPRTQL ESRLGEVVGG VIVLARDIHA KEQEEARRKE AYRLAVERYE FLTTRHADEV ARFEALEADA ANWERAAKLR AFADAKERQL RAVGGPSAEQ ADWLAWARAK ADWLDPLVLV SDVILDAPEP KRPGYW
|
| |