Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1711 |
Symbol | |
ID | 7084131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1925725 |
End bp | 1926921 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643698732 |
Product | protein of unknown function UPF0027 |
Protein accession | YP_002355362 |
Protein GI | 217970128 |
COG category | [S] Function unknown |
COG ID | [COG1690] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGTGA AGACCGTCCT GAACGGCAGC ACCGTGCCGG TCAGGATCTG GACCGACGAC ATCGACGAGG GCTCGAAGGC GCAGCTCGCC AACCTCGCCA GCCTGCCCTT CATCCACCAC CACGTCGCCG CCATGCCCGA CGTGCATCTG GGCATCGGCG CGACCATCGG CTCGGTGATC GCCACCCACC AGGCGATCAT CCCGGCGGCG GTGGGCGTCG ACATCGGCTG TGGCATGGTC GCGGCGCGCC TGTCGCTCAC GGCGAATGAC ATCGACGAGA AGCGGCTGAA GAAGGTGTTC GATCAGATCA CGCGCGACGT GCCCGTCGGC CGTGACCAGC ATGCCGACGG TCGCGTGCTC GTCGATGCGG TGCGCCCCTT CGAGCCCGGC CTCAAGGCCC TGACCGATCG TCACCCGCAG CTGCTCAAGG CCTTTGGCAG GTTCTCCAAG TGGGCCAACC AGATGGGCAC GCTCGGCGGT GGCAACCACT TCATCGAGGT CTGCCTGGAT GAGAAGCGCC GGGTGTGGGT GATGCTGCAC TCGGGCAGCC GGGGCATCGG CAACGCGATC GCGACCTACT TCATCGAGCT CGCCAGGAAG GACATGGAAC GGCACATGAT CCACTTGCCC GACCGCGACC TGGCCTACTT CCGCGAGGGC AGCCCGCATT TCGACGACTA CGTCGAGGCG GTGCACTGGG CGCAGGACTA TGCGATGGCC AACCGCCAGG CGATGCTCGA GCTCGTGCTC GCAGGGCTGG CGCGCCACCT GCCGCCCTTT ACCGTGACTA CCGAGGCGGT GAACTGCCAC CACAACTACG TCGCCCGCGA GCACCACTTC GGCGCCGACG TGTGGGTGAC CAGAAAGGGC GCGATCCGCG CGGGGGAGGG CGAGCTCGGC ATCGTCCCCG GCAGCATGGG CGCGCGCAGC TACATCGTGC GCGGCAGGGG GAACGCGGAG AGCTTCTGTT CCAGTGCGCA CGGCGCTGGC CGGCGTATGA GCCGCACGGC GGCCACCAAG CACTTCACCG AGGCCGACCT TGCGCGCCAG ACCGAGGGCG TGATCTGCCG CAAGGACAAG GGCGTGGTGG ACGAGATCCC CGGCGCGTAC AAGGACATCG ACACCGTGAT GGCCAACCAG TCGGACCTGA CCGAGGTGCT GCACACGCTG AAGCAGGTGG TGTGCGTGAA AGGGTAG
|
Protein sequence | MPVKTVLNGS TVPVRIWTDD IDEGSKAQLA NLASLPFIHH HVAAMPDVHL GIGATIGSVI ATHQAIIPAA VGVDIGCGMV AARLSLTAND IDEKRLKKVF DQITRDVPVG RDQHADGRVL VDAVRPFEPG LKALTDRHPQ LLKAFGRFSK WANQMGTLGG GNHFIEVCLD EKRRVWVMLH SGSRGIGNAI ATYFIELARK DMERHMIHLP DRDLAYFREG SPHFDDYVEA VHWAQDYAMA NRQAMLELVL AGLARHLPPF TVTTEAVNCH HNYVAREHHF GADVWVTRKG AIRAGEGELG IVPGSMGARS YIVRGRGNAE SFCSSAHGAG RRMSRTAATK HFTEADLARQ TEGVICRKDK GVVDEIPGAY KDIDTVMANQ SDLTEVLHTL KQVVCVKG
|
| |