Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0133 |
Symbol | |
ID | 7085231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 158011 |
End bp | 159198 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643697177 |
Product | hypothetical protein |
Protein accession | YP_002353826 |
Protein GI | 217968592 |
COG category | [S] Function unknown |
COG ID | [COG4394] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCGAT TCCGGTCGCC CGCCCGTCCC CGCTGGGAGA TCTTCTGTCA GGTTGTGGAT AATTTTGGGG ATATCGGCGT GTGCTGGCGG CTGGCGACCG ATCTGGCACG CCGTGGAGGT GGAGCGGTGC GGCTCTGGGT GGATGAATGG GAGGCCTTGC GGCGCATCTG TCCGGCCGCC GCCGCGGTGG ATGAGCGGGC GGGCGGAAGC GTGCAAGGGG TGGAGTTGCG GCGCTGGACG AGCGCGTTCG TTCCGCTCGT GCCGGGCGAG ATCGTCGTCG AGGCCTTCGC CTGCGAGCTG CCGGCCGCCC AGCTCGAGGC CATGGCGGCG CTGCGGCAGG CGCCGTTGTG GATCAACCTC GAGTACCTGT CGGCCGAGGC CTGGGTGGCG GGCTGTCACG GCATGGCCTC GCCGCATCCG CGCCTGCCGC TGGTGAAGCA TTTCTTTTTT CCGGGCTTCG ATGCGGGCAC TGGCGGATTG CTGCGCGAGG CCGACCTGCT CGCGCGCCGC GACGCCTGGC GAGCCCGTCC CGAGGGACGC GCCGCCTGGC TGGCCGCGCG CGGCATCGCG AGCGGCCCGG ATGCGCTGCG GGTGTCGCTG TTCGCCTACG AGCAGCCGGA GCTCCCCGCC CTGTTCGACG CCTGGGCCGG CAGCGGTCGC GAGATGCTGG TGCTGGTGCC CGAGTCGCGC GTGCTCGGCG ACGTGCAACG TGCGCTCGGT CGCGAGCGCC TGCAGGCGGG CGATCGGGTG GTGCGCGGCG CGCTCACCGC CTGCGTGCTG CCCTTCACCG ACCAGGCGGG CTACGACGAG CTGCTGTGGG CCTGCGAGCT GAATTTCGTG CGCGGCGAGG ACTCCTTCGT GCGCGCGCAG TGGGCGGCGC GGCCCTTCGT GTGGCAGATC TATCCGCAGG CGGATGCCGC CCATCACGAC AAGCTCGAGG CTTTCCTGGC GCGCTACCTG CAGGGGCTGC CGGCGGCCGA GGCGGATGCG CTCGCCCGCT TCTGGCGCGC CTGGAACGGC TGCGCGCCGG ATCGCGCGAC GCCGGCCGAG GCCTGGCCGG CCCTCGCCGC GGCGCTGCCC GGGCTCGATG TCCATGCTCG CCGCTGGTGC GATGTGCAGG CGGAGCTGCC CGATCTCGCG ACCGCACTCG ATACATTCTG TTCCCACATC GCGCACGCCG CGCGGTAG
|
Protein sequence | MHRFRSPARP RWEIFCQVVD NFGDIGVCWR LATDLARRGG GAVRLWVDEW EALRRICPAA AAVDERAGGS VQGVELRRWT SAFVPLVPGE IVVEAFACEL PAAQLEAMAA LRQAPLWINL EYLSAEAWVA GCHGMASPHP RLPLVKHFFF PGFDAGTGGL LREADLLARR DAWRARPEGR AAWLAARGIA SGPDALRVSL FAYEQPELPA LFDAWAGSGR EMLVLVPESR VLGDVQRALG RERLQAGDRV VRGALTACVL PFTDQAGYDE LLWACELNFV RGEDSFVRAQ WAARPFVWQI YPQADAAHHD KLEAFLARYL QGLPAAEADA LARFWRAWNG CAPDRATPAE AWPALAAALP GLDVHARRWC DVQAELPDLA TALDTFCSHI AHAAR
|
| |