Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3926 |
Symbol | |
ID | 7873572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4321989 |
End bp | 4323050 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643700863 |
Product | hydrogenase expression/formation protein HypE |
Protein accession | YP_002890886 |
Protein GI | 237654572 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0309] Hydrogenase maturation factor |
TIGRFAM ID | [TIGR02124] hydrogenase expression/formation protein HypE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.224134 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACCG TCAAAACCGG CTACGTCCGC CCCCTGGACC TGCGCAAGGG CCGCGTCGAC CTCGGCTTCG GCGCCGGCGG GCGGGCGATG GCGCAATTGA TCTCCGAACT TTTCCTGCGC GCGTTCGCCA ACGACTGGCT CGCCCGCGGC GACGACGGCG CGGTGCTGCC CGCTCCCGCA GCAGGCGAGC GCCTGGTGAT GGCGACCGAC GCCCACGTGG TGAGCCCGCT GTTCTTCCCC GGCGGCGACA TCGGCAGCCT GTCGGTGCAT GGGACGGTGA ACGACCTCGC GGTGATGGGC GCACGCCCGC TGTACCTCGC CGCCAGCTTC ATCCTGGAAG AAGGCTATGC GCTCGCCGAC CTCGCCCGCA TCGTCGAATC GATGGCCTCC GCGGCGCGCG CGGCCGGGGT GGCGGTGGTG ACCGGCGACA CCAAGGTGGT CGAACAGGGC AAGGGCGACG GCGTGTTCAT CACCACCACC GGGGTGGGGG CGCTGCCGGC CGGGCGCGAT CCGGGCGGCG CACGGGCGCG GCCGGGCGAC GTGGTGCTGG TGTCGGGGCG CATCGGCGAC CATGGCATGG CAATCATGGC GCAGCGCGAG TCGCTCGCCT TCGACTCCGA GATCGTCTCC GACAGCGCGG CGCTGCACGG CCTGGTCGAG GCGCTCTACG CCGCGGTGCC GGCCGAGGCG GTCCGCGTGC TGCGCGACCC CACGCGCGGC GGACTGGCGA CCACCTTGAA CGAGATCGCC GCGCAGTCGG GCGTGGGCAT GGAGCTCGAC GAGGCGGAGA TCCCGGTGTC GGCGCAGGTG CAGGCTGCCT GCGAGCTGCT CGGGCTCGAC CCGCTCTACG TCGCCAACGA GGGCAAGCTG GTGGTGCTGT GCGCTCCGGA GCACGCCGGC GCGGCGCTCG CCGCGCTGCG CGCGCATCCG CTCGGCACCG AGGCGGCGCG GATCGGATGC GCCACCGCCG ACCCGCAGCA CTTCGTGCAG CTGCGCACCG GCCTGGGCGG GCGGCGCATG GTGGACTGGA TCGCCGGCGA GCAGCTGCCG CGGATCTGTT GA
|
Protein sequence | MNTVKTGYVR PLDLRKGRVD LGFGAGGRAM AQLISELFLR AFANDWLARG DDGAVLPAPA AGERLVMATD AHVVSPLFFP GGDIGSLSVH GTVNDLAVMG ARPLYLAASF ILEEGYALAD LARIVESMAS AARAAGVAVV TGDTKVVEQG KGDGVFITTT GVGALPAGRD PGGARARPGD VVLVSGRIGD HGMAIMAQRE SLAFDSEIVS DSAALHGLVE ALYAAVPAEA VRVLRDPTRG GLATTLNEIA AQSGVGMELD EAEIPVSAQV QAACELLGLD PLYVANEGKL VVLCAPEHAG AALAALRAHP LGTEAARIGC ATADPQHFVQ LRTGLGGRRM VDWIAGEQLP RIC
|
| |