Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2206 |
Symbol | |
ID | 7085559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2487537 |
End bp | 2488700 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643699226 |
Product | Cysteine desulfurase |
Protein accession | YP_002355842 |
Protein GI | 217970608 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0240111 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTTCG CTCCCATCTA CCTCGACTGG AACGCGACCA CGCCGCTCGA TCCGGCGGTG CGCGAGGCCA TGCTGCCCTG GCTCGGCGCC GCCGAGCCGG CGCGCTTCGG CAACGCCTCC AGCCGCCACG AATACGGCCG CCAGGCGCGT GCCGCGGTGG ACGAGGCGCG CGCGCGGGTG GCCGCGGCGG TCGGCGCGCA CGCGACCGAG GTGATCTTCA CCAGCGGCGG CTCGGAAGCC AACAATCTCT TCCTCAAGGG CGCCGCGCCC AACCGCGAGC CGGGCGTGGT GGCGGTGAGT GCGATCGAGC ACCCCTGCGT GCGCGAGCCG GCCCGCCAGC TGCGGCGCGC GGGCTGGACG CTGCGCGAGA TCGCGGTCGA TGCGCAGGGC GTGATCGACC CCGCCGACTG GAGCGCGGTG CTGGAAGCCC GCCCGGGCCT GGTGTCGGTG ATGCTGGCCA ACAACGAGAC CGGCGTGCTG CAGGACGTCG CCGCACTGGC GCGCGCGGCG CGCGCCGCCG GCGCCTGGTT CCACACCGAC GCGGTGCAGG CGCTGGGCAA GGTGGCGGTG GATTTCCGCG CGCTCGGCGT GCATGCGATG ACGCTGTCCG CGCACAAGAT CGGCGGTCCG CTCGGCGCGG GTGCGCTGGT GCTGGACAAG CGCGTCGAGC TCGCGCCGCT GATCGCCGGC GGCGGCCAGG AGCGCGGGTT GCGCTCGGGG ACCGAGAACG TGGCCGCGAT CGTCGGCTTC GGCGTGGCTT GCGAGCGCGC GGTCGCGCGG CGCGAAGGCG AGGGCGTGCG CCTGGCCGCC CTGCGCGACG AACTCTGTGC CGCGCTCGCC GGGCGCGGCG CGCGGATCTT CTCGGCGGCT GCCCCGCGCC TGCCGAATAC CGTATTCTTC GCGGTCGAGG ACATCGACGG CGAGACCCTG GTCGGCCGCC TCGACCGTGC AGGCTTCGCG TGCGCGAGCG GCTCGGCGTG CTCGAGCGCG AACCCCGAAC CGTCCCGCAC CCTGCTGGCG ATGGGCGTGG AGCGCGGGCT GGCGCGCGCC GCGGTGCGTG TGAGCCTCGG TCGCGACACG CGCGCGGACG ACGTGCGCAG CTTCATCGAA ACCTTCGTCC GGGTGACGGA CGAACTCAAG AACCTGGCCT CGATCGGGGC CTGA
|
Protein sequence | MNFAPIYLDW NATTPLDPAV REAMLPWLGA AEPARFGNAS SRHEYGRQAR AAVDEARARV AAAVGAHATE VIFTSGGSEA NNLFLKGAAP NREPGVVAVS AIEHPCVREP ARQLRRAGWT LREIAVDAQG VIDPADWSAV LEARPGLVSV MLANNETGVL QDVAALARAA RAAGAWFHTD AVQALGKVAV DFRALGVHAM TLSAHKIGGP LGAGALVLDK RVELAPLIAG GGQERGLRSG TENVAAIVGF GVACERAVAR REGEGVRLAA LRDELCAALA GRGARIFSAA APRLPNTVFF AVEDIDGETL VGRLDRAGFA CASGSACSSA NPEPSRTLLA MGVERGLARA AVRVSLGRDT RADDVRSFIE TFVRVTDELK NLASIGA
|
| |