Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_1684 |
Symbol | |
ID | 5741515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 2066377 |
End bp | 2067711 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641292784 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_001558795 |
Protein GI | 160879827 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATTG AACTTATGCA AACGTATTGT TCATGGAGTG AATATGAAAA TGCAATTATT GATGTCTTAA GTGTTTATAA GTCTGAAGGT CATGAATCAC CAATACATGC ATCTACATTT GATGTGGACG CGCTGCGAAA CGATTTTCCG ATCCTCTCAA CAAAGGTTCA TGGAAAACCC TTGATTTGGC TTGATAATGC CGCAACAACC CAAAAACCTA AGCCTGTTAT AGATCGCCTT TCCTATTTCT ATGAACACGA AAATTCAAAT ATCCACAGAG GCGCACACAC CCTTGCTGCT ACGGCTACAG ATGCTTATGA AGCGGCGAGA AACAAAATTA AACATTTTAT AAACGCTTCT TCTTACGAAG AAATTATTTT TGTACGAGGT GCTACGGAAG GTATTAATCT GGTAGCAGCA TCTTATGGGA GACATAACCT AAATAAGGAT GACGAGATTC TCGTTTCCTG TCTCGAACAT CATGCTAATA TTGTACCCTG GCAAATGCTT TGCGCGGAAA CAGGCGCTGT ACTACGTATT ATACCTGTCG ATGATACCGG ACAGATAGAT ATGCAGTCAT ATAAGAAGCT GTTGTCACCC AAAGTAAAAA TTGTGGCCGC TTCCTATGTA TCCAATTCTC TTGGCACAAT AACGCCCATT AAGGACATTG TTGCTATGGC ACATCAATAT GGAGCAAAAG TTTTGGTCGA CGCTGCTCAG GCAGTACCTC ATTTTAAAGT GGATGTTAAT GATTTAGACT GTGATTTTCT TGTCTTTTCA GGTCATAAGC TATTTGGACC CACAGGCATC GGCATATTAT ATGGTAAAAA GGATACGTTA AATGAAATGT CGCCCTATCA AGGCGGCGGA AATATGATTG ACAATGTAAC ATTCGAAAAA ACTACTTATC AACTGACACC CCAAAGATTT GAAGCGGGTA CAGGCGATAT AGCAGGCGCT GTGGGACTAG GTGCTGCCGT GGATTATCTG CAGCATATAG GAATGGATCA TATTGCTAAA TATGAGCATA GTCTTTTGTC GTATACTTCT GATGCCTTAC GTAATATTCC CGGACTGTCT GTCATAGGAA CAGCCTCAGA AAAAGCTGGA GTCATCTCTT TTACACTAAA CGGTATTGCC ACAGAATCCA TCGGACAGAT GTTGGATAAA GAAGGGATTG CAGTCCGTAC AGGCCATCAT TGTTCTTTAC CAATTTTAAG AAGATTTGGC GTAGAAAGCA CAGCCAGAAT TTCATTAGCT TTTTATAATA CCTACAGAGA GATCGATGTA CTGATAGATA CTCTATGGAA ACTAATAACA AAGACGTATA CTTAA
|
Protein sequence | MSIELMQTYC SWSEYENAII DVLSVYKSEG HESPIHASTF DVDALRNDFP ILSTKVHGKP LIWLDNAATT QKPKPVIDRL SYFYEHENSN IHRGAHTLAA TATDAYEAAR NKIKHFINAS SYEEIIFVRG ATEGINLVAA SYGRHNLNKD DEILVSCLEH HANIVPWQML CAETGAVLRI IPVDDTGQID MQSYKKLLSP KVKIVAASYV SNSLGTITPI KDIVAMAHQY GAKVLVDAAQ AVPHFKVDVN DLDCDFLVFS GHKLFGPTGI GILYGKKDTL NEMSPYQGGG NMIDNVTFEK TTYQLTPQRF EAGTGDIAGA VGLGAAVDYL QHIGMDHIAK YEHSLLSYTS DALRNIPGLS VIGTASEKAG VISFTLNGIA TESIGQMLDK EGIAVRTGHH CSLPILRRFG VESTARISLA FYNTYREIDV LIDTLWKLIT KTYT
|
| |