Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0552 |
Symbol | |
ID | 7085166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 623019 |
End bp | 623996 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643697579 |
Product | cysteine synthase A |
Protein accession | YP_002354221 |
Protein GI | 217968987 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCACCT GGTACCCCGA CAATGCAGAA TCCATCGGCC GCACCCCGCT CGTGCGCCTG AACCGCGTGC TCGACGGCGC GCAGGCCACC GTGCTGGCGA AGATCGAGGG CCGCAACCCG GCCTACTCGG TCAAGTGCCG CATCGGCGCG GCGATGGTCA AGGACGCGAT CGCCCACGGC CGCCTCGGCC CCGGCCAGGA GATCGTCGAG CCCACCAGCG GCAACACCGG CATCGCGCTC GCCTTCGTGT GCGCCGCGCG CGGCATCCCG CTGACCCTGA CCATGCCCGA GACCATGAGC GTGGAGCGCC GCAAGCTGCT CGTCGCCTAT GGCGCCAGAC TGGTGCTGAC CGAAGGCCCC AAGGGCATGA ACGGCGCCAT CGCCAAGGCC AAGGAGATCG TCGACAGCGA CCCCGGCCGC TACGTGCTGC TGCAGCAGTT CGAGAACCCC GCCAACCCGG CGATCCACGA GACCACCACC GGCCCGGAGA TCTGGAACGA CACCGACGGC GGCATCGACA TCCTGGTGTC GGGCGTGGGC ACCGGCGGCA CCATCACCGG CATCTCGCGC TACATCAAGC GCGTGCGCGG CAAGGACATC CGCTCGATCG CGGTCGAGCC CGCCGCCAGC CCGGTGATCA GCCAGACCCT CGCCGGCCAG CCGCTGACCC CTGCGCCGCA CAAGATCCAG GGCCTGGGCG CGGGCTTCGT GCCGAAAGTG CTCGACCTCT CGCTGATCGA CGCGGTCGAG CAGGTGAGCA ACGAAGACGC GGTGCTCTAC GCCCGCCGCC TGGCGCGCGA GGAGGGCATC CTCGCCGGCA TCTCCTGCGG CGCGGCGGTG GCCGCCGCGG CGCGCGTCGC GAAGCAGCCC GAGAACGCCG GCAAGACCAT CGTGGTGATC CTGCCCGACT CGGGCGAGCG CTACCTGAGC TCGATCCTAT TCGAGGGCCT GTTCGACGAG AAGGGCATGG CGATCTGA
|
Protein sequence | MSTWYPDNAE SIGRTPLVRL NRVLDGAQAT VLAKIEGRNP AYSVKCRIGA AMVKDAIAHG RLGPGQEIVE PTSGNTGIAL AFVCAARGIP LTLTMPETMS VERRKLLVAY GARLVLTEGP KGMNGAIAKA KEIVDSDPGR YVLLQQFENP ANPAIHETTT GPEIWNDTDG GIDILVSGVG TGGTITGISR YIKRVRGKDI RSIAVEPAAS PVISQTLAGQ PLTPAPHKIQ GLGAGFVPKV LDLSLIDAVE QVSNEDAVLY ARRLAREEGI LAGISCGAAV AAAARVAKQP ENAGKTIVVI LPDSGERYLS SILFEGLFDE KGMAI
|
| |