Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1624 |
Symbol | |
ID | 7084834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1818977 |
End bp | 1820086 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643698644 |
Product | molybdopterin biosynthesis protein MoeY |
Protein accession | YP_002355275 |
Protein GI | 217970041 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00849574 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGAAC AAGGCACAAT GGAGCGGCTG CTCGAGCGTG CGCGCTGGGC GCCGAGCGGT GACAACACCC AGCCTTGGCG CTTCGAGGTG CTCGGTGTGG ATCGGCTTGC GATCCATGGT TTCGATACGC GCGCCGAGGT GGTCTACGAC TTTGACGGGC GGGCGAGCCA GCTTGCGCAT GGTGCCTTGC TCGAGACCCT GCGCCTGGCC GCCTCGGCCC GCGGCTTGCG CGCCGAATGG ACGATTCGCC CCGCTTGCAG CGAAGAGGCA CCGATCTACG ACGTCGTGCT CGTCCCCGAT GCGACTCTAC GCCCCGATCC GCTAGAGGCC TTCATCGAAA GTCGTGTCGT GCAGCGCCGG CCGATGCGCG TCACCGCGCT CGGTCCGGCC GAGCGCGAGG CACTTGCCGC AGCGGTCGGC AGCGATTACC GGCTGAGGTT CTTTGAAGGT TTCGATGGCC GCTGGCAGGT GGCGAGGCTC TTGTGGCACA GCGCGCGCAT CCGCCTGACC ATGCCCGAGG CCCTGAACGT GCATCGCAGC ATCATCGAAT GGCGCGCGCG CTTCAGCGAG GACCGCATTC CCGAGCAAGC GGTGGGGGTC GATCCCCTCA CCGCGCGTCT CATGCGCTGG GTGATGCAGA GCTGGAGCAG GGTGGAGTTC TTCAATCGCT ACCTGCTCGG CACTGTGGCG CCGCGCGTTC AGCTCGATCT GCTGCCGGCA CTCGGGTGTG CGGCGCATGT GCTGATGACC CCGGTGCGTC CGCTGCGCAC GCTCGGCGAT TTTGTCGCCG CCGGCGCGGC GATGCAGCGT CTGTGGCTGG CGGTCGAGGC CGCGGGCATG CACCTCCAGC CCGAGATGAC GCCGGTGATT TTCCGCTGGT ATGTCCAGGC GGGGCGTTCG CTGTCGCCCC ATCCCGGCAT GGATGCTGCG GCTGGCGCGC TCGCACGCCG CTTCGAGGCG CTTGCCGATG CCGGACCGGC GGATGGTTTT GCCTTCTTTT GCAGGGTGGG GTACTCGGCG CTGCCGCATT CGCGTTCGCT GCGCAAGGCG CTCGGTTCGC TGCTCGTCGG GGAAGGGGGC GCTTCCGATG GGAGCGGACG AGAGGTGTGA
|
Protein sequence | MIEQGTMERL LERARWAPSG DNTQPWRFEV LGVDRLAIHG FDTRAEVVYD FDGRASQLAH GALLETLRLA ASARGLRAEW TIRPACSEEA PIYDVVLVPD ATLRPDPLEA FIESRVVQRR PMRVTALGPA EREALAAAVG SDYRLRFFEG FDGRWQVARL LWHSARIRLT MPEALNVHRS IIEWRARFSE DRIPEQAVGV DPLTARLMRW VMQSWSRVEF FNRYLLGTVA PRVQLDLLPA LGCAAHVLMT PVRPLRTLGD FVAAGAAMQR LWLAVEAAGM HLQPEMTPVI FRWYVQAGRS LSPHPGMDAA AGALARRFEA LADAGPADGF AFFCRVGYSA LPHSRSLRKA LGSLLVGEGG ASDGSGREV
|
| |