Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1439 |
Symbol | |
ID | 7083522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1603761 |
End bp | 1605239 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643698457 |
Product | protein of unknown function DUF513 hemX |
Protein accession | YP_002355094 |
Protein GI | 217969860 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2959] Uncharacterized enzyme of heme biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACAAG AAAATCCCGC TCTGACCCAG CCGCCGCCGG CGACGTCCGC ATCGCCGGCC CCGGAGGAGA CCCGACCCGT CATCGAGGAG GTGGGCGTGC CCGAGGCACC GCCCGCAGCC GAGGTGGATC CCGCGGCGCG CGGCGCTGCG AGCGAGGGCG ATCCGGTCGC CCCCGCCGCA CGAACGGCGA ACGCCGCCGA GGAGCAGCCC GCTCGCAGTC CCGCCGCGTG GTGGGCGCTG CTGCTGGCGC TGATCGCCGT CGGCGTGGCC GGCTGGTCGA TCTGGCAGGC TCGCGAGATG CGCCTGCAGA CGGGACAGCT GCGCGCGGAG GTCGCCAGCC GTCTCTCCGA TGGCGAGACC ATCGCCACCG AGGCGCGCGG GATGATGCGC CAGCAGCAGG AGGTCATCGC CTCGCTGCAG GGCAAGCTCG GTGCGCTCGA ATCCAGGGTC GAGACCACAC AGGGCCAGGC CGAGGCGCTC GAGGCGCTTT ACCAGGAGTT CTCGCGCAGC CGCGAGGACG GCGTGCTCGC CGAAGTCGAG CAGGCCGTCG CGCTGGCCTC TCAGCAGCTG CAGATCGCGG GCAACGTCGA GGCCGCGCTG ATCGGCCTGC AGGAGGCCGA GGCCCGGTTG GCGATCCACG ATCGCGGCCA GCTCGCCACC CTGCGCCGTG CGCTCGTGCG CGACATCGAG GAGCTGCGCC TGCTGCCGGT GCTGGATGTT TCCGGCCTCA GCCTGCGCCT GGAGCTGATG CTCGAGCGTG CGGACACGCT GCCGCTCGCC TTCGAGAGCC CGCTGCCGGC TGCCGCTGCG GTGGGCGCGG AGATGGGGCC TGCCGATGGC GGTGGCTTCG TGGGCTGGAT GGCCGGTGTG TGGCGCTTCG CGCAGAACGT CGCGGCCGAT GCCTGGAGCG AGATCCGCAC CCTGATCCGC GTCGAGCGCC TCGACCAGGA AGATCCGGTG CTGCTCGCGC CCGAGCAGAA CACCTTCCTG CGCGAGAACC TCAAGATGCG CCTGCTCACC GCGCGCCTCG CGCTGATGGC GCGCGACGGC CGCAGCTACG CAGCCGATCT TGCCCAGGCC CGCCAGTGGT TGGAGCGTTT CTACGACCTG CGCGACGAGC GGGTGCAGGC CGCGCTCGGC GAGTTGAAGC AACTCGAGGC GGTGAAGGTG CGCTACGCGC CGCCCGATCT GAGCGAGACC TTCAGCGCCT TGCGCAGCGT GCAGTCGCGC GCTGGCCGCA GCGGCGCGGA TGCGCGCGGC GCTGCCGCGC CTGCGCAGGC AGCGCCTGCA GCCGCTGCCG CGCCCGCTCC TGCGTCCGCC CCTGCCGAAG CGCCCGTGGC GACTGCCGAG ACCCCGGCAT CCGCAGCCCA GGGAGCGGCG AACCAGCCTG CCGCTGCCGA GGCGCAGCCT GCTGCGCCGG CCGAGCAGGC GCCTGCGGGC TCCGTGACCG ACGCTGCCGC CCCGGCGGCG AGCCAGTGA
|
Protein sequence | MRQENPALTQ PPPATSASPA PEETRPVIEE VGVPEAPPAA EVDPAARGAA SEGDPVAPAA RTANAAEEQP ARSPAAWWAL LLALIAVGVA GWSIWQAREM RLQTGQLRAE VASRLSDGET IATEARGMMR QQQEVIASLQ GKLGALESRV ETTQGQAEAL EALYQEFSRS REDGVLAEVE QAVALASQQL QIAGNVEAAL IGLQEAEARL AIHDRGQLAT LRRALVRDIE ELRLLPVLDV SGLSLRLELM LERADTLPLA FESPLPAAAA VGAEMGPADG GGFVGWMAGV WRFAQNVAAD AWSEIRTLIR VERLDQEDPV LLAPEQNTFL RENLKMRLLT ARLALMARDG RSYAADLAQA RQWLERFYDL RDERVQAALG ELKQLEAVKV RYAPPDLSET FSALRSVQSR AGRSGADARG AAAPAQAAPA AAAAPAPASA PAEAPVATAE TPASAAQGAA NQPAAAEAQP AAPAEQAPAG SVTDAAAPAA SQ
|
| |