Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2759 |
Symbol | |
ID | 7873499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2987812 |
End bp | 2988918 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643699681 |
Product | protein of unknown function UPF0118 |
Protein accession | YP_002889736 |
Protein GI | 237653422 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.163255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCTC CCCGCGTCGA TCGCCTGCAA ACCGTCGCCT GGACCGCGAC CGGCGCCGCC CTCGTCGCCC TGCTCTGGCT GCTCGGACCC ATCCTCACCC CCTTCGTGGT AGGGGCGGTG TTCGCCTACA TCTGCGACCC GGCGGTCAAC TGGATGGTCG CGCGCCGCGT GCCGCGGGCA CTGGCGGTGC TGCTGGTGAT CCTCGCGCTC GGCCTGCTGC TGATCGCGCT CGCGCTGATC CTGGTACCGA TGGTCTATCG CGAGGGCGTG CTGCTGGTGC GCCGCCTGCC CGAGCTGGTG CAGATGTTCA ACCTCAACGT CGCGCCGCTG CTCGAGGCCC GCCTCGGCGT AGACATCAGG CTCAACGCCG AACAGTTCCA GCAGCTGATC GCCGACAACT GGACGAGCGC GCAGGAACTG GTGCCGGCGG TGCTCGCCCA CCTCAAGACC GGCGGCATGG CGGTGCTCGG CTTCCTCGCC AACGTGGTGC TGATTCCGCT GGTGATGTTC TACCTGCTGC AGGAGTGGCC GCGCATCCTC GACGAGCTCG AGCGCATCGT GCCGCGCCCC TGGGTCGACG GCACCAAGCG CGTCCTCGGC GACATCGACT CGGTGATGTC CGAGTTCCTG CGCGGCCAGC TCTCGGTGAT GCTGCTGCTG GCGGTGTTCT ACAGCGCCGG CCTGTGGCTG GCCGGGCTCA ACTTCTGGCT GCCGGTGGGC GTGCTCACCG GCCTGCTGGT CTTCATCCCC TACGTCGGCT TCGGCGGCGG GCTGATCCTC GCCATCGTCG CCGCGCTGCT GCAGGCGCAG GGCTGGCCGC CGCTGGCCGG CGTGGCAATC GTGTATGCGC TTGGTCAGGT CGTCGAGAGC TTCGTGCTCA CGCCCTACCT GGTGGGCGAA CGCATCGGCC TGCACCCGCT CGCGGTGATC TTCGCGCTGA TGGCCTTCGG CCAGCTCTTC GGCTTCGTCG GCGTGCTGGT CGCGCTGCCG GTGAGCGCCG CGCTGCTGGT AGGCCTGCGC GAGGTGCGCG AGGCCTGGCT CGCCAGCCCG GTGTATCTCG GCACGCAGCC GCGGCCGATC ATCGCGAGCG AGCGCGAGCG CCCATGA
|
Protein sequence | MKPPRVDRLQ TVAWTATGAA LVALLWLLGP ILTPFVVGAV FAYICDPAVN WMVARRVPRA LAVLLVILAL GLLLIALALI LVPMVYREGV LLVRRLPELV QMFNLNVAPL LEARLGVDIR LNAEQFQQLI ADNWTSAQEL VPAVLAHLKT GGMAVLGFLA NVVLIPLVMF YLLQEWPRIL DELERIVPRP WVDGTKRVLG DIDSVMSEFL RGQLSVMLLL AVFYSAGLWL AGLNFWLPVG VLTGLLVFIP YVGFGGGLIL AIVAALLQAQ GWPPLAGVAI VYALGQVVES FVLTPYLVGE RIGLHPLAVI FALMAFGQLF GFVGVLVALP VSAALLVGLR EVREAWLASP VYLGTQPRPI IASERERP
|
| |