Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2214 |
Symbol | |
ID | 3830821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2311666 |
End bp | 2312817 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637830136 |
Product | cysteine desulphurase-like protein |
Protein accession | YP_431046 |
Protein GI | 83591037 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01977] cysteine desulfurase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0350826 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000530193 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCCATTT ACCTGGATAA CGCCGCCACT ACCTTTCCCA AGCCGCCGGC GGTCTGGCAG GCCATGGAGC ATTTTATGAA GAATATAGGA GCCAGCGCCG GCAGGGGCGG CTACCGACGC GCCCTGGCGG CGGAAGAGAT AGTCTTCCAG TGCCGGCGGC TACTGGGGAA GTTATTTAAC ATTAATGACG CCACCCGCAT CGTTTTTACC GCCAATGCCA CTGAGGCCAT CAACCTGGCC CTCAAGGGCT GGTTGAACCC CGGCGATCAT GTCATTACTA CGGCTATGGA ACATAATGCC GTCTGGCGCT GTTTGAAAAC CCTGGAGAAG GAGCGCGGGA TAAGCATTAC CGTGGTGCCC TGCCGGGAAG ACGGCGAACT GCTACTGCCA GAGCTGGATG CGGCCTTCCG TAAGGAAACG CGCCTGCTGG CCTGCACCCA TGCCTCCAAT GTTACGGGAA CGGTGATGCC GGTGCAGGCA ATTACTGCCG CAGCCCACCG GCACGAGGTG CCTGTCCTGC TGGACACCGC CCAGACAGCC GGAGTATATC CCATAGATGT CCAGGAACTG GATATCGATT TCCTGGCCTT TACCGGTCAC AAAGGCCTGC TCGGACCCAT GGGTACCGGC GGCCTCTATA TACGTCCCGG TTTCGATCTG CGTCCCCTGA AGGAAGGGGG TACCGGTTCG GTGTCCCGCC TGGAGTACAT GCCTGAGGGC CTGCCGGATC GCTTCGAGGC CGGGACGCTG AATGTTGTCG GCATTGCCGG CCTTAAAGCC GCTGTGGAAT ATGTCCTCAA CCAGGGGATC AGCCGGATCC GTTCCCACGA GGAGGTTCTG ACGGCCAGGA TGCTGGCCGG GCTGGAAGAG TTACCTGCGG TGACCGTCTA TGGTCCGAGG GAAACGGCAC CCAAGGTGGG CGTGGTTTCC TTTAACATTA AAGAACTGGC CCCGGAAGAG GTAGCCTACG CCCTGGACGA GGGCCACGAA ATTATGGTCC GGGTGGGCCT GCACTGCGCG CCTCTGGCCC ATAAAACCAT AAGCACTCTG GAGCGGGGAA CGGTGCGGGC CAGCGTAAGT TATTTTAATA CCACGGAAGA GATAGACGCC TTTTTAACGG CGGTAAGGGA AATTGTAGAA TTAACAAGCT GA
|
Protein sequence | MAIYLDNAAT TFPKPPAVWQ AMEHFMKNIG ASAGRGGYRR ALAAEEIVFQ CRRLLGKLFN INDATRIVFT ANATEAINLA LKGWLNPGDH VITTAMEHNA VWRCLKTLEK ERGISITVVP CREDGELLLP ELDAAFRKET RLLACTHASN VTGTVMPVQA ITAAAHRHEV PVLLDTAQTA GVYPIDVQEL DIDFLAFTGH KGLLGPMGTG GLYIRPGFDL RPLKEGGTGS VSRLEYMPEG LPDRFEAGTL NVVGIAGLKA AVEYVLNQGI SRIRSHEEVL TARMLAGLEE LPAVTVYGPR ETAPKVGVVS FNIKELAPEE VAYALDEGHE IMVRVGLHCA PLAHKTISTL ERGTVRASVS YFNTTEEIDA FLTAVREIVE LTS
|
| |