Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2130 |
Symbol | |
ID | 7085400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2405881 |
End bp | 2407077 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643699149 |
Product | protein of unknown function UPF0027 |
Protein accession | YP_002355766 |
Protein GI | 217970532 |
COG category | [S] Function unknown |
COG ID | [COG1690] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGTGA AGACCGTCCT CGACGGCAGC GCCGTACCGG TCAGGATCTG GACCGACGAC ATCGACGAGG GCTCGAAAGC GCAGCTCGCC AACATCGCCA GCCTGGCCTT CATCCACCAT CACGTGGCTG CCATGCCCGA TGTGCATCTG GGCATCGGCG CGACCATCGG CTCGGTGATC GCCACCCATC AGGCCATCAT TCCGGCGGCG GTGGGTGTCG ACATCGGCTG CGGCATGGTC GCGGCGCGCC TGTCGCTCAC CGCCAACGAC ATCGACGAGA AGCGCCTGAA GAAGGTGTTC GATCAGATCA GCCGCGATGT GCCGGTCGGC CGCGACCAGC ATGCGGACGG CCGGGTACTG GTCGACGCGG TGCGCCCGTT CGAGCCGGGC CTCAAGGCCC TGACCGAGCG TCACCCGCAA TTGCTCAAGG CCTTCGGCAA GTTCTCCAAG TGGGCGAACC AGATGGGCAC GCTCGGGGGC GGCAACCATT TCATCGAGGT CTGCCTGGAC GAGCACGAGC AAGTCTGGGT GATGCTGCAC TCGGGCAGCC GTGGCATCGG CAACGCCATC GCGACGTACT TCATCGAGCT GGCGAGAAAG GACATGGCGC GCCACATGAT CCACCTCCCC GATCGCGATC TGGCCTACTT TGCCGAGGGC AGCGAGCACT TCGCCGATTA CGTCGAGGCT GTGCATTGGG CGCAGGAGTA CGCGATGGCC AACCGCCAGG CGATGCTCGA TCTCGTGCTC ACGGGGCTGG CGCGCCACCT GCCGCCCTTT ACCGTCACCA CCGAGGCGGT GAACTGCCAC CACAACTACG TCGCCCGCGA GCACCACTAT GGCGCCGATG TCTGGGTGAC GCGCAAGGGC GCGATTCGTG CAGGGAAGGG GGAGCTCGGC ATCGTCCCTG GCAGCATGGG CGCGCGCAGC TACATCGTGC GCGGCAAGGG CAATGCGGAG AGTTTCTGCT CCAGCGCGCA CGGCGCCGGC CGGCGCATGA GCCGCACGGC CGCGAACAAA CGCTTCACCG AGGCCGACCT CGCACGCCAG ACCGAAGGGG TGATCTGTCG CAAAGACAAG GGCGTGGTTG ATGAGATCCC CGGCGCGTAC AAGGACATCG ACGAAGTGAT GGCCAACCAG CGCGACCTCA CCGAGGTCCT GCATACCTTG AAGCAGGTGG TGTGCGTGAA GGGGTAG
|
Protein sequence | MPVKTVLDGS AVPVRIWTDD IDEGSKAQLA NIASLAFIHH HVAAMPDVHL GIGATIGSVI ATHQAIIPAA VGVDIGCGMV AARLSLTAND IDEKRLKKVF DQISRDVPVG RDQHADGRVL VDAVRPFEPG LKALTERHPQ LLKAFGKFSK WANQMGTLGG GNHFIEVCLD EHEQVWVMLH SGSRGIGNAI ATYFIELARK DMARHMIHLP DRDLAYFAEG SEHFADYVEA VHWAQEYAMA NRQAMLDLVL TGLARHLPPF TVTTEAVNCH HNYVAREHHY GADVWVTRKG AIRAGKGELG IVPGSMGARS YIVRGKGNAE SFCSSAHGAG RRMSRTAANK RFTEADLARQ TEGVICRKDK GVVDEIPGAY KDIDEVMANQ RDLTEVLHTL KQVVCVKG
|
| |