Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2973 |
Symbol | |
ID | 7874363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3219830 |
End bp | 3221197 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643699894 |
Product | protein of unknown function DUF1329 |
Protein accession | YP_002889949 |
Protein GI | 237653635 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.96658 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTTCT GCAAGCTGAC TGCGCTCGCC GGCGCCCTGC TGGCCAGCCA CGCCGCCCTG GCCGCCGACG CGGCGAGCCT GGGCACCACG CTGACCCCGC TCGGCGGCGA ACGCGCGGGC AACGCCGACG GCAGCATCCC GGCCTGGACG GAAGGCTCGG CGCTCGAGCC GGGCTGGAGC TACGGCAAGC CGCGCGCCGA CTACTTCAAG CACAAGGGCG ACAAGCCGCT GTTTACGATC GATGCCGCCA ACGTCGACAA ATACGCCGGC AAGCTCACCG AGGGCCAGGT CGCGGTGATC AAGGGCTTCA AGAACTACAA GCTCGAGGTC TATCCGAGCC GCCGCTACTG CTCCGCGCCC GACTTCGTGC AGGCCAACAC CAAGGCCAAC GTGGGCATGG CGAAGATCGG CGCCGACGGC TGGAGCCTGG CCGAGGCCAC CGTGCCCGGC ATTCCCTTCC CGCTGCCGCA GAGCGGCATC GAGGTGCTGT GGAACGCCAA GATGAAGTAC GCCGGCGTCG CCCTCGACAT GAAGGCGCTG TGGGTGATGC TGTCGCCGCG CAGCGGCGGC AGCGACTGGA TCGAGGCCGG GTCCACGCAG ACCTATTACT ACCCCTGGGG CAAGAAAGGC TCGAACAAGC TCAGCGAGCT GCCGCCGGTC GAGTACCACA CCTACTTCAA CTACACCTCG CCCACCGCAC TGGCCGGCCA GGCGCTGGTG ATCACCTCGT ACCTGAACAA GACCAGCGAC ACCTTCTACT ACTTCCCCGG CCAGCGCCGC GTGCGCCGCA TGCCGAGCTA CTCCTACGAC GCCCCCCAGG TCGGCTTCGA GAACCAGTAC ACGCTCGATG AGCCGCGCGT CTTCAACGGC ACGCCCGACC GCTTCGACTG GAAGCTGGTG GGCAAGAAGG AAATGTTCAT CGGCTACGGC AGCTTCGGCA TGTACGACCC CGCTGCCGAC CGCCGCAAGG TGGTGACGCC CGACGGCGTG GATCCGAAGG CGACGCGCTA CGAGCTGCAC CGCGTCTGGG TGGTCGAGGC CACCGCCAAG GACGGTGTGC GCCACGTCGC GCCCAAGCGC CGCTTCTACT TCGACGAGGA CTCCTGGGCG CTGATGGGCG CGGAGGACTA CGACGCCCAG GGCAAGCTGT GGAAGGTCCG CGAGAGCTTC CTGATTCCGG TCGCCGAAAC GGGCGCCTGC GACAACCCGG CCTTCGTGCA GTACGACCTC GTCTCCGGCC GTGTGCTCTT CGACCAGGCG GGCATGGGTG CCGGCAAGGA CATGGTCTGG GCGGTCGAGG CCGACGATCC GAAGTACAAG GACGCCTTCT ACACCCCGGA CAACCTGCGC GCGATCAGCG ACCGCTGA
|
Protein sequence | MRFCKLTALA GALLASHAAL AADAASLGTT LTPLGGERAG NADGSIPAWT EGSALEPGWS YGKPRADYFK HKGDKPLFTI DAANVDKYAG KLTEGQVAVI KGFKNYKLEV YPSRRYCSAP DFVQANTKAN VGMAKIGADG WSLAEATVPG IPFPLPQSGI EVLWNAKMKY AGVALDMKAL WVMLSPRSGG SDWIEAGSTQ TYYYPWGKKG SNKLSELPPV EYHTYFNYTS PTALAGQALV ITSYLNKTSD TFYYFPGQRR VRRMPSYSYD APQVGFENQY TLDEPRVFNG TPDRFDWKLV GKKEMFIGYG SFGMYDPAAD RRKVVTPDGV DPKATRYELH RVWVVEATAK DGVRHVAPKR RFYFDEDSWA LMGAEDYDAQ GKLWKVRESF LIPVAETGAC DNPAFVQYDL VSGRVLFDQA GMGAGKDMVW AVEADDPKYK DAFYTPDNLR AISDR
|
| |