Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3055 |
Symbol | |
ID | 7874525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3308066 |
End bp | 3309250 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699978 |
Product | O-succinylhomoserine sulfhydrylase |
Protein accession | YP_002890030 |
Protein GI | 237653716 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases |
TIGRFAM ID | [TIGR01325] O-succinylhomoserine sulfhydrylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCGCC CCCTTCCGCC CGAGCTTCAC CCCGACACCC TGGCCGTGCG CGCCGGCACC GAGCGCACGC AGTTCCGCGA GCACGGCGAG GCGATGTACC TCACCTCGAG CTTCGTCTTC GACAGCGCCG CGCAGGCCGC CGCCTGCTTC TCGGGCGAGG AGGAGGGCTA CGTCTACGCG CGCTTCTCCA ACCCCACCGT GACCGCGATG CAGAACCGGC TCGCCGCGCT CGAAGGCGGC GAGGCCTGCA TCGCCACCGC GTCCGGCATG TCCGCGATCC TGTCGCTGGC GATGGCGACC ATGCAGGCGG GCGACCACGT GGTGGCCTCC AACGGGCTGT TCGGCGCCAC CCAGCAGCTC TTCGGCGGCA TCCTGTCGAA GTTCGGCATC GCCACCAGCT TCGTGCCGGC CACCGAGCTC GACGCCTGGC GCGCCGCGAT CACGCCGCGC ACCAGGCTCT TCTTCACCGA GACGCCCTCC AACCCGCTCA CCGAGGTGAT CGACATTGCC GGCGTGGCGG CGATCGCGCG CGCGGCGGGG GTGATCTTCG CGGTCGACAA CTGCTTCTGC ACGCCGGCGC TGCAGCGCCC GCTGGAGCTC GGCGCCGACG TCGTCGTGCA TTCGGCCACC AAGTATCTCG ACGGCCAGGG CAGGGTGCTC GGCGGCGCGG TGGTCGGCAG CAAGGCGATC ACCGACGAAG TGTTCAAGTT CCTGCGCACC GCCGGGCCGA CCCTGTCGCC GTTCAACGCC TGGGTGATCC TCAAGGGCCT GGAGACGCTG CGCATCCGCA TGGAGGCGCA GTCGGCGAGT GCGCTCGAGC TGGCGCGCTG GCTCGAGGCC CAGCCGGGCG TGGCGCGGGT GTACTACCCC GGGCTGGAAT CCCACCCCCA GCACGCGCTG GCCATGCGCC AGCAAAAGAG CGGCGGCGCC ATCGTCAGCT TCGACGTGAA GGGCGGCCGC GAGGCGGCGT GGAAGGTGGT CGACGCCACC CGCCTGATCT CGATCACCGC CAATCTTGGC GACACCAAGA GCACCATCAC CCACCCCGCG ACCACCACCC ACGGCCGCAT CAGCGCCGAG GCGCGCGCCA CCGCCGGCAT CGGCGACGGC CTGCTGCGCA TCGCCGTCGG TCTGGAAGAC GTCGACGACC TCAAGGCCGA CCTCGCCCGC GGCCTGGCGG GCTGA
|
Protein sequence | MNRPLPPELH PDTLAVRAGT ERTQFREHGE AMYLTSSFVF DSAAQAAACF SGEEEGYVYA RFSNPTVTAM QNRLAALEGG EACIATASGM SAILSLAMAT MQAGDHVVAS NGLFGATQQL FGGILSKFGI ATSFVPATEL DAWRAAITPR TRLFFTETPS NPLTEVIDIA GVAAIARAAG VIFAVDNCFC TPALQRPLEL GADVVVHSAT KYLDGQGRVL GGAVVGSKAI TDEVFKFLRT AGPTLSPFNA WVILKGLETL RIRMEAQSAS ALELARWLEA QPGVARVYYP GLESHPQHAL AMRQQKSGGA IVSFDVKGGR EAAWKVVDAT RLISITANLG DTKSTITHPA TTTHGRISAE ARATAGIGDG LLRIAVGLED VDDLKADLAR GLAG
|
| |