Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1971 |
Symbol | |
ID | 3831153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 2054267 |
End bp | 2055193 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637829902 |
Product | cysteine synthase |
Protein accession | YP_430812 |
Protein GI | 83590803 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00000186941 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGTCGTA TTTACCAGAA CATCACCCAG CTTATCGGTG GTACCCCGAT CGTCAAGCTC CACCGTATGA ATCCAAGCGA AGCAGAAGTT CTGGTCAAGC TGGAACTTTT TAACCCCGGC GGAAGTATCA AAGATCGTAT CGCCTTAGCC ATGATTGAAG ACGCCGAAGA GCGGGGGCTC CTGGACAAAG ACACCGTTAT TATTGAACCG ACCAGTGGCA ATACCGGCAT CGGCCTGGCC ATGGTGGCCG CGGCACGGGG ATACAAATTG ATCCTAACCA TGCCGGAAAC CATGAGCCAG GAACGGCGTC AGCTTCTCAA GGGCTTCGGT GCCGAGCTGG TCCTCACGCC GGGGGCCGAG GGGATGAAGG GGGCCATCCG CCGGGCCGAA GAGCTGGCTG CCACCTATCC CCGGGCCTTC ATCCCCCAGC AGTTTGAAAA TCCCGCCAAC CCGGCCGCCC ATCGCCATAC TACCGCTGCT GAGATCTATG ACGCTACCGA CGGGCAGCTG GATATCCTGG TCTGTGGTGT CGGTACCGGC GGGACCATCA CCGGCACCGG CGAAGTTTTA AAAGAACGGA TACCCGGACT GCAGGTAGTA GCTGTCGAAC CGGCTTCATC CCCTGTCCTT TCCGGCGGTA TGGCCGGACC CCATAAAATC CAGGGTATAG GCGCGGGGTT TGTTCCGGAA ATTCTCAATA TCGACGTTAT CGACGAGATT ATCCAGGTTA CGGACGAGGA AGCCATTGAT ACTGCTCGAC GGCTGGCCAG GGAAGAGGGT ATTATGGTCG GCATATCTTC CGGGGCCGCC TGCTGGGCAG CCCTGCGCCT GGCCGCCCGC CCGGAAAACG CCGGCAAGCG TATCCTGGCA GTGCTACCCG ACACCGGTGA ACGCTACCTG TCTACCTCGC TTTTCCAGGA GATTTAA
|
Protein sequence | MGRIYQNITQ LIGGTPIVKL HRMNPSEAEV LVKLELFNPG GSIKDRIALA MIEDAEERGL LDKDTVIIEP TSGNTGIGLA MVAAARGYKL ILTMPETMSQ ERRQLLKGFG AELVLTPGAE GMKGAIRRAE ELAATYPRAF IPQQFENPAN PAAHRHTTAA EIYDATDGQL DILVCGVGTG GTITGTGEVL KERIPGLQVV AVEPASSPVL SGGMAGPHKI QGIGAGFVPE ILNIDVIDEI IQVTDEEAID TARRLAREEG IMVGISSGAA CWAALRLAAR PENAGKRILA VLPDTGERYL STSLFQEI
|
| |