Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1800 |
Symbol | |
ID | 7085770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2021677 |
End bp | 2023326 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643698822 |
Product | ErfK/YbiS/YcfS/YnhG family protein |
Protein accession | YP_002355448 |
Protein GI | 217970214 |
COG category | [S] Function unknown |
COG ID | [COG2989] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.081998 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAGCA AGCCCGGTCT GGGCAAATGT TTTGTAACTG CCGCGGCCCT CGCCGCGCTG GCCCTCATGG CGTCCGGCGT GCATGCGACA GCGGCTGGCG AGGCCGCGGT GGTCGCGGTC GAGCCGGCCG CGAGCGACTC CGCGCTCGCG CTGCGCATCG AGCAGCGCCT GCGCGAGCGC GCCGCGACGG CCGACGACCC CATCGCGCTG TTCTACCTCG CGCGTGCCTA CCGCTCGGTG TGGACCGAAC CTGCGCGCGT GCACGCCCTG CTCGCGGCGG TCGAGGCGGT ACGCGGGCAT GGGCTGGATG CGGCGGATTT CGCCCCCGCC CGCCTGCGCG CCGGTGCCGT CCCCGCGGCC GATCCCGAGC GTGCGGCCGA GCGCGAGCTC TTGCTCACCG ACACCCTGGC CGCGCTGCTC TTCCAGCTCC GCCATGGCAA GGTCGACCCG CGCGCGCTCT ACCGCGAGTG GAACTTCACC CCGCCGCCCA GGCCCTACGA GCGTGCGGCC GAGCTTGCAC GCGTGCTGCA GGCGCCCGAT CTGGCCGCCG CGGTCGACGC GTACGCGCCG GACCTGCCGC TCTACCGCGC GCTGCGCGCC GAGCTGCTCG CCCAGCAAGG CCGGCTCGCC GTGGGCGACT GGCCCAAGGT CGCCGCCGGG CCCACCCTCA AGCCCGGTGC AAGCAGCTCG CGCGTGGCCT CGTTGCGCGC GCGCCTGGCC GCCGCGGGCG AGCGCGTGTC CGAGGCGCGC GACAAGTCCC ACTACGACGA AGCTCTGGTC GAGGCGGTCA AGCGCTTCCA GGCCGCGCAC GGCCTGCAGG CCGACGGCGT GCTCGGGGCG CAGACCCTGG AGGCGCTCAA CGCCAGCCCG GCGCAGCGCG TGGCGCAGAT CCGCGCCAAC CTCGAGCGCC TGCGCTGGGT GGCGAGCGAC CTGCAGGGCG ACCGCCTGCT GGTGGACATC GTCGGCTACC ACGCCGACCT CGTGCTCGAC GGCCAGCCGG TGTGGTCCTC GCGGGTGATC GTCGGCAAGC CCAAGCGGCG CACCCCCTCG CTGCTCGACA GCGTCACCCA TCTGGTGCTC AACCCGAAGT GGGTGGTGCC ACCCACCATC CTGCGCGAGG ACGTGATTCC GGGCGCAGCG CGCAACCCGT CCTATCTCGC CAACCGGCGC CTGCGCGTGG TCGATCGCAG CGGGCAGACG GTGGACCCCG CCACCATCGA CTGGAGCGGG GCGCGCCAGA GCGGTTTTCC CTATCGCGTC GAGCAGCAGT CCGGTGCCGA CGGCTCGCTC GGGCGGATCA AGTTCTCGCT CTCCAACCCC TACGTGATCT ACCTGCACGA CACCAACGCG CGCTCCCTGT TCAAGCGCGC CGAGCGTGCG CTCAGCTCGG GCTGCGTGCG CGTGGAGAAG CCCGAGGAGC TGGCGGTGCT GCTGCTCGCC GACAGCGGGC GCTGGAGCGC GCAGGCGCTG CAGGCGGCGC TCGACAGCGG GCGCACGCGC ACCGTGGACG TGGGGCGCGA CGTCAAGGTG TTGCTGCACT ACGCCACCGC GGCGCTCGAC GAGGCGGGCA GGGTGCTGCT GCGCAACGAC ATCTACGGCT ACGACGCGGC GATCGTGGCC GCGCTCGATG CGCCCGCGCC GGCGCGCTGA
|
Protein sequence | MQSKPGLGKC FVTAAALAAL ALMASGVHAT AAGEAAVVAV EPAASDSALA LRIEQRLRER AATADDPIAL FYLARAYRSV WTEPARVHAL LAAVEAVRGH GLDAADFAPA RLRAGAVPAA DPERAAEREL LLTDTLAALL FQLRHGKVDP RALYREWNFT PPPRPYERAA ELARVLQAPD LAAAVDAYAP DLPLYRALRA ELLAQQGRLA VGDWPKVAAG PTLKPGASSS RVASLRARLA AAGERVSEAR DKSHYDEALV EAVKRFQAAH GLQADGVLGA QTLEALNASP AQRVAQIRAN LERLRWVASD LQGDRLLVDI VGYHADLVLD GQPVWSSRVI VGKPKRRTPS LLDSVTHLVL NPKWVVPPTI LREDVIPGAA RNPSYLANRR LRVVDRSGQT VDPATIDWSG ARQSGFPYRV EQQSGADGSL GRIKFSLSNP YVIYLHDTNA RSLFKRAERA LSSGCVRVEK PEELAVLLLA DSGRWSAQAL QAALDSGRTR TVDVGRDVKV LLHYATAALD EAGRVLLRND IYGYDAAIVA ALDAPAPAR
|
| |