Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1403 |
Symbol | |
ID | 3831690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1450441 |
End bp | 1451592 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637829339 |
Product | cysteine desulphurase-like protein |
Protein accession | YP_430259 |
Protein GI | 83590250 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01977] cysteine desulfurase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTTATC TCAATAATGC CGCCACCTCC TGGCCCAAGC CGGAGACTGT TTACCAGGCC CACGACGCTT ATTTGCGCCA TCTCAGCGGC AGCGTCAACC GCGGCGCCGG CGGGGCCTCC CTGGACGCCG GCCGGGCGGT CCTGGAGACC AGGGAACTCC TGGCGGATTT CTTTAACATC CGCGAACCGG AGCAGGTCAT CTTCACAGCC AACGCCACCG AAGCCCTCAA CCTGGCCCTT CAGGGGTTGC TGGAGCCGGG AGACCACGTG GTTATCAGCA GCCTGGAGCA TAATGCCGTG GCGCGGCCCC TGCATACCCT GCAGGATAAG GGGGTCGAAT ATACCATCGT CAACTGCGAC GCCCGGGGCC GCCTCAATCC TTTAGATGTG GAAAGGGCCA TCGGCCCCCG GACCAGGTTG ATTTGCCTGA CCCACGCTTC CAACGTAACC GGCACCATCC TACCTGTCAA CGAAGTGGGG GAGATCGCCC GGCGGCATCA TCTCCAGTAC CTGGTGGATA CGGCCCAGAC GGCGGGGGAA ATCCCCGTGG ATGTGGAGGC GGCCGGGATT ACCCTGCTGG CCTTCACCGG CCACAAGGGC CTGCTGGGCC CACCTGGGAC CGGCGGGCTT TATATCCGTT ACCCCGATAC CGTCCGGCCC CTGATTTATG GCGGGACAGG CAGCAGGTCC GAACTCCTCA CCCAGCCAGA GGTGCTGCCT GATAAGTATG AGAGCGGCAC CGTCAATGCC CCGGCCATAG CCGCCCTGGG GGCGGGGGTA AGGTTTATCC GGGAGACGGG CTTGGACAAT ATCCGCCGCC ATACCAGGGA ATTGACAGCC CGGCTCCTGG AGGGGTTGCG CCGCCTGCCG GGGGTAACCC TTTATGGCCC CACGGATTCG GGGGAGCGGG TACCGGTGGT TTCATTAAAT ATCCGCGGCC TCTCTCCCGG GGAGGCCAGC GCCTGGCTGG CGGACCACTA TGATATTGTC AGCCGCCCGG GGCTCCATTG CGCCCCCCTG GCCCACCAAA CCATCGGCAC CCTTAAGACG GGCACCCTGC GCCTGAGCCC TGGCTTTTTT AATACCGCTG CCGAAATCGA TGCCGCCCTG GCAGCAATCA AAGAACTGAT GGAGGTCATG CAGGGTGAAT GA
|
Protein sequence | MIYLNNAATS WPKPETVYQA HDAYLRHLSG SVNRGAGGAS LDAGRAVLET RELLADFFNI REPEQVIFTA NATEALNLAL QGLLEPGDHV VISSLEHNAV ARPLHTLQDK GVEYTIVNCD ARGRLNPLDV ERAIGPRTRL ICLTHASNVT GTILPVNEVG EIARRHHLQY LVDTAQTAGE IPVDVEAAGI TLLAFTGHKG LLGPPGTGGL YIRYPDTVRP LIYGGTGSRS ELLTQPEVLP DKYESGTVNA PAIAALGAGV RFIRETGLDN IRRHTRELTA RLLEGLRRLP GVTLYGPTDS GERVPVVSLN IRGLSPGEAS AWLADHYDIV SRPGLHCAPL AHQTIGTLKT GTLRLSPGFF NTAAEIDAAL AAIKELMEVM QGE
|
| |