Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1932 |
Symbol | |
ID | 7310648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2282022 |
End bp | 2283203 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643608866 |
Product | cysteine desulfurase NifS |
Protein accession | YP_002506260 |
Protein GI | 220929351 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000108248 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGATA AAACAGTATA TTTGGATCAT GCTGCCACCA CATATGTTAA ATCTGAAGTA TTCGATGCTA TGAAACCGTA TTTTAGTGAG CATTTTGGAA ATGCTTCATC TATATATAGT TTGGGTCGTG ACAGCAAAAA AGCAGTAGAA GAGTCCAGAG AAAAGGTTGC AAATGCAATA GGTGCAGAAC CCAGAGAGAT ATACTTCACA GGCTCTGGAA GTGAAGCAGA TAACTGGGCA CTCAAGGGAA TAGCTGCAGC GTTTAAGAAA AAAGGAAACC ATATCATTAC ATCCGCTATT GAGCATCCGG CGATAATGAG TTCATGTAAA TACCTTGAAG GCGAAGGTTT TGAGATAACT TATCTTCCTG TTGACAGTGA CGGACTTGTA ACGCCTGAAC AGGTAAGAGA TGCAATAAGG GATAATACAA TACTGATAAG CATAATGTTT GCGAATAATG AGATAGGTAC TATTCAGCCT ATAAAGGAGA TTGGTGCCAT TGCAAAAGAA AAGGGAGTTT TATTCCATAC AGATGCTGTA CAGGCAGTAG GAAATATAAG GATAGACGTT AAAGAGCTAA ATGTCGACCT CCTTTCATTA TCTAGTCACA AATTTTATGG GCCAAAAGGA ATAGGAGCTT TATATATAAA GAAAGGCATC AAGATTCCCT CTTTTATCCA CGGAGGACAG CAGGAGCGTG GAAAAAGAGC GAGTACAGAA AATGTTCCGG CAATAATTGG TTTGGGAAAA GCAATTGAGA TTGCTACAGA AAACCTTGAC GAATACAACA AGAAACTCAC AGAATTCAGA GAAAAAACTA TTGAGGGACT TTTTGCAAAA GTTCCATATA TCAGGTTAAA CGGACACAGA CATAACAGAC TGCCCGGTAA CGTAAATATT TCATTTGAAT TTATAGAGGG AGAATCACTT CTCCTAATGC TTGATATGAA GGGAATTTGC GGCTCCAGCG GGTCAGCCTG TTCATCAGGT TCACTAGACC CGTCACATGT ACTTCTTGCA ATAGGTTTGC CTCATGAGAT AGCACATGGT TCGTTGAGGC TTACTTTTGG AGATGAAAAT ACTCATGAAG ATGTTGATTA TATACTTGAG GTTATACCTC AGATGGTAAG AAAACTAAGA GATATGTCGC CTCTTTGGGA AGCTGTAAAA GATAAGAAGT AA
|
Protein sequence | MEDKTVYLDH AATTYVKSEV FDAMKPYFSE HFGNASSIYS LGRDSKKAVE ESREKVANAI GAEPREIYFT GSGSEADNWA LKGIAAAFKK KGNHIITSAI EHPAIMSSCK YLEGEGFEIT YLPVDSDGLV TPEQVRDAIR DNTILISIMF ANNEIGTIQP IKEIGAIAKE KGVLFHTDAV QAVGNIRIDV KELNVDLLSL SSHKFYGPKG IGALYIKKGI KIPSFIHGGQ QERGKRASTE NVPAIIGLGK AIEIATENLD EYNKKLTEFR EKTIEGLFAK VPYIRLNGHR HNRLPGNVNI SFEFIEGESL LLMLDMKGIC GSSGSACSSG SLDPSHVLLA IGLPHEIAHG SLRLTFGDEN THEDVDYILE VIPQMVRKLR DMSPLWEAVK DKK
|
| |