Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1976 |
Symbol | |
ID | 7085487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2230409 |
End bp | 2231875 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699001 |
Product | protein of unknown function DUF112 transmembrane |
Protein accession | YP_002355623 |
Protein GI | 217970389 |
COG category | [S] Function unknown |
COG ID | [COG3333] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGGGT TCGAGGTCGC GCTGTCGCCG GGCAATCTCT GGTTCTGCCT GATCGGGGTG GCGCTCGGAA CCCTGGTCGG CGTGTTGCCC GGGGTCAGCT CGCTGGCCGC GGTCGCAATG CTGTTGCCAT TCACGCATGC CTTGTCGCCA ACCGCGGCGC TGGTGCTGTT GGCGGGGGTC TATTACGGCT CCGAGTACGG CGGGGCGATC GCGGCGATCC TGCTCAACGT GCCGGGAACG CCCGCTTCCG CGGTTTCATG TCTCGACGGT CACCCGATGG CGCGCGCGGG GCGTGCCCGC GAGGCGCTGA TCCTGAGTGC GGGAGCGTCC TTCGGCGGCG CGGTCGTGGG CATCGGCATG ATGCTCGCCA TGGCCGCCCC GCTCGCCGGG CTCGCGTTCA TGCTCGGGCC GGCCGAATAC TTCGCGATCA TGATCTTCGG ACTGGTCTGC ACGGCCGCCG TGGGCCGCGG GCGTCTGGCC GCGGGCATGG TGGCGATGCT CGCCGGGGTC GGCCTCGCGA TGGTCGGTAC CGACGCCCAG ACCGGCGTGG CCCGTTTCAC CCTCGGGCTG ACGGAACTGC GCGACGGCAT TCCGCTGGTG GTGCTCGCCA TGGGGCTCTT CGGCGTTTCG GAGGTGATGC TGTCGATGTT CGGCGCCTTG CCTGTGGTGC GTGGCGGGAT TGGCGAGCGC AGCGCGCTCC CGCGCTGGAA GGCGTGGCGC CGTGCAGCAG GGCCGGTGCT GCGCGGGGGC GTGCTGGGCG GATTGTTTGG TGCCTTGCCG GGCACCGGAC CCACCCTGGC GTCGATCGCC GCGTATGCGC TCGAGCGTCG TCTTTCTGCT CGCCCGGAGT GCTTCGGCCG CGGCGCGATC GCCGGGCTGG CGGCCCCCGA GGCGGCCAAC AACGCGGCGG CGCAGACGGC TTTCGTGCCC ACGCTGGCGC TGGGCATCCC CGGCAGCGCA ACCATGGCTC TCATGCTCGG CGCGATGTCG GCCCATGGGG TGGTGCCCAG CCCGCTGCTG GCGGTCGAGA ATCCGGAGCT CTTCTGGGGG CTGATGGCGA GCTTCGTGAT CGGCAACCTG CTGCTGCTCG TGCTCAACGT TCCGCTGGTG GGGGTCTGGG TTCGCATCCT GCGGCTGCCG CCCCAGCGAC TCTATCCACT CGTCCTCGTG CTGATCGCGG TGGCGACCTT CGGTGTGCGC GGCAGTGCCT TCGACGTCTG GGCGGCCCTC GCCATCGGCC TGGTGGCCTA CGGGATGAGG CGATCGGGGG TGCATCTGGC ACCCGTGCTC ATCGGCTTCG TCCTCGGGCC GCTGGTCGAG GACAACTTCC GCCGCGCGCT GGCAATTTCG CAGGGCGACT TTGCGATCTT CGTCTCCAGC CCGATCGCCC TCGCGGCGCT CGGCGCGGTC TTCGTGCTGC TCGCCGTCGC CGTCTGCCGG TCGATGCGCA GCGCAAGGGC CGCGTAG
|
Protein sequence | MAGFEVALSP GNLWFCLIGV ALGTLVGVLP GVSSLAAVAM LLPFTHALSP TAALVLLAGV YYGSEYGGAI AAILLNVPGT PASAVSCLDG HPMARAGRAR EALILSAGAS FGGAVVGIGM MLAMAAPLAG LAFMLGPAEY FAIMIFGLVC TAAVGRGRLA AGMVAMLAGV GLAMVGTDAQ TGVARFTLGL TELRDGIPLV VLAMGLFGVS EVMLSMFGAL PVVRGGIGER SALPRWKAWR RAAGPVLRGG VLGGLFGALP GTGPTLASIA AYALERRLSA RPECFGRGAI AGLAAPEAAN NAAAQTAFVP TLALGIPGSA TMALMLGAMS AHGVVPSPLL AVENPELFWG LMASFVIGNL LLLVLNVPLV GVWVRILRLP PQRLYPLVLV LIAVATFGVR GSAFDVWAAL AIGLVAYGMR RSGVHLAPVL IGFVLGPLVE DNFRRALAIS QGDFAIFVSS PIALAALGAV FVLLAVAVCR SMRSARAA
|
| |