Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1911 |
Symbol | |
ID | 7085680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2156431 |
End bp | 2157621 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643698936 |
Product | PUA domain containing protein |
Protein accession | YP_002355558 |
Protein GI | 217970324 |
COG category | [R] General function prediction only |
COG ID | [COG1092] Predicted SAM-dependent methyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCAAC TCGTCCTCCA CCCCGGCAAG GAACGCTCGC TGTTCCGCCG CCATCCCTGG ATCTTCGCCG GCTCGGTGGA CCGGCTCGAG GGCCGCGCGC GGCCGGGCGA CACGGTGACC GTGGTGACGG CCGAGGGCAA GGCGCTGGCG CGGGCGGCGT GGTCGCCGTC CTCGCAGATC CGCGCGCGGG TGTGGAGCTT CGATGCCGAG GCGGCGATCG ACCACGCCTT CTTCAAGCGC GCGGTGGCGG CCTCGGTGGC GCGCCGCGCG GCGATCCCGG CGCTGCGCGG GCAGGAGGGT GTGCGCCTGA TCCACGGCGA GTCCGACGGT CTGCCGGGCG TGATTGCCGA CCGCTACGGC GAGGTGGTGG TGCTGCAGCT CACCAGTGCG GGCGCGGACA AGTGGCGCGA GGCCATCGTC GCCGGGCTGG TGCAGGCCAC CGGCTGCGCG GCGGTGTACG AGCGTTCCGA CTCCGAGGTG CGCGGGCTGG AAGGTCTGGA GTCGCGCACC GGCTGCGCGC ATGGTGCGCT GCCGGCGGGC GGGCTCACCA TCGTCGAGAA CGGCGTGCGC ATGGAGGTGG ACGCCGAGGG CGGGCACAAG ACCGGCTTCT ATCTCGACCA GCGCGACAAC CGCCTGCTCA CCGGCCAGCT CGCGGGCGGG CGCACGGTGC TCAACTGCTT CTGCTACACC GGCGGCTTCT CGCTGCAGGC GCTCGCCGGC GGGGCGGCCT CGGTGCTGTC GATCGACTCG TCCGGCCCGG CGCTGGCGTC GGCGCGGCGC AACCTCGCGC TCAACCCGCA ACTCGACGCC GAGCGCGCCG AGTGGCTGGA GGCGGACGTG TTCAAGGCCC TGCGCGCGCT GAAGGACGAG GGCCGCAAGT TCGACCTGAT CGTGCTCGAC CCGCCCAAGT TCGCGCCCTC GGCCGCGCAC GCCGAGCGCG CCGCGCGCGC CTACAAGGAC ATCAACCTGT TCGGCTTCCG CCTGCTGAAC CCGGGCGGCA TCCTGATGAC CTATTCGTGC TCGGGCGGCA TCGGGCAGGA GCTGTTCCAG AAGATCGTCG CCGGCGCGGC CATCGACGCC GGGGTGGATG CGCGCATCCT GTACCGCCTG TCGGCCGCGC CGGACCATCC GATCGGGCTG GCCGTGCCGG AGGGCGAATA TCTCAAGGGG CTGGCCTGCC AGGTGGGGTG A
|
Protein sequence | MAQLVLHPGK ERSLFRRHPW IFAGSVDRLE GRARPGDTVT VVTAEGKALA RAAWSPSSQI RARVWSFDAE AAIDHAFFKR AVAASVARRA AIPALRGQEG VRLIHGESDG LPGVIADRYG EVVVLQLTSA GADKWREAIV AGLVQATGCA AVYERSDSEV RGLEGLESRT GCAHGALPAG GLTIVENGVR MEVDAEGGHK TGFYLDQRDN RLLTGQLAGG RTVLNCFCYT GGFSLQALAG GAASVLSIDS SGPALASARR NLALNPQLDA ERAEWLEADV FKALRALKDE GRKFDLIVLD PPKFAPSAAH AERAARAYKD INLFGFRLLN PGGILMTYSC SGGIGQELFQ KIVAGAAIDA GVDARILYRL SAAPDHPIGL AVPEGEYLKG LACQVG
|
| |