Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0372 |
Symbol | |
ID | 4808449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 467460 |
End bp | 468854 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640105786 |
Product | sulfide dehydrogenase (flavoprotein) subunit SudA |
Protein accession | YP_001036803 |
Protein GI | 125972893 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases |
TIGRFAM ID | [TIGR01316] glutamate synthase (NADPH), homotetrameric |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.63559 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAACA TGTCACCGAA AAAAGTTCCC ATGCCGGAGC AGGACCCAAA CGTCAGAATC AAAAACTTTT TGGAGGTTGC TTTAGGATAT ACCGAGCAAA TGGCAATGGA AGAAGCTCAA AGGTGTCTTA ACTGCAAGCA CAAACCTTGT GTTTCCGGCT GTCCCGTAAA CGTAAAAATT CCTGAGTTTG TACAGCTTAT CGCTCAGGGA AAATTTGAGG AAGCCTACAA TAAAATAAGA GAAACCAACA ACCTTCCGGC AATATGCGGC AGAGTCTGTC CGCAGGAAAA CCAGTGTGAA AAGTTCTGTG TAAGGGGTAT AAAAGGTGAA CCTGTTGCCA TAGGAAGGCT TGAAAGATTT GCGGCGGACT GGCACATGAA AAACGGCACC ACTTCTTATG AAAAGCCTGA AAAAAACGGC AAAAGGGTGG CAGTAATAGG TTCGGGACCT GCAAGCCTTA CCTGTGCAAG CGACCTGGCC AAACTCGGCT ACGAAGTAAC AATCTTCGAA GCCTTTCACG TGCCCGGCGG AGTGTTGATG TACGGTATTC CGGAATTCAG GCTTCCAAAG AAACTGGTTC AGGAGGAAAT TGAAACCATA AAGCAGCTGG GTGTGGAAAT TAAAACAAAT ATGGTTATAG GAAAGGTTTA TTCCATTGAC GAACTCAAAG CTGAAGGATA TGATGCCATA TTTATAGGCT CGGGTGCCGG ATTGCCTTCA TTTATGAAAA TTCCCGGAGA AAACCTCAAC GGAGTTTACT CGGCAAATGA GTTTCTCACA AGAATAAACC TCATGAAGGC TTATGAATTC CCCAACTGCG ATACTCCCGT GAAAGTAGGA AAGAATGTCG CCGTTGTGGG CGGAGGAAAT GTCGCAATGG ACGCCGCAAG AAGCGCAAAA AGACTTGGCG CGGAAAACGT TTATATAGTA TACAGGCGTT CGGAAGCGGA AATGCCCGCA AGACTTGAAG AAATTCATCA CGCAAAGGAA GAAGGAATTT TGTTCAAATT CCTTACAAAC CCCACAAGAA TTCTTGGCAC CGACGACGGC TGGGTCAAAG GCATGGAGTG CATAGAGATG GAGCTGGGCG AACCTGATGA ATCCGGAAGA AGAAGACCCG TGCCAAAGCC GGGATCCGAA CATGTAATTG ATGTTGAAAC GGTTATTATC GCCATCGGCC AAACTCCAAA TCCGTTAATT GCCTCAACAA CCCCGGGGCT GGCCACTCAA AAATGGGGCG GAATTATTGT CGATGAAAAC ACCGGCGCCA CCAACATAGA AGGTGTATAT GCCGGCGGAG ATGCGGTAAC CGGTGCCGCA ACCGTCATTC TTGCAATGGG AGCAGGCAAA AAAGCCGCAA AGGCAATTGA CGAATATCTT AAAAACAAAA AATAG
|
Protein sequence | MPNMSPKKVP MPEQDPNVRI KNFLEVALGY TEQMAMEEAQ RCLNCKHKPC VSGCPVNVKI PEFVQLIAQG KFEEAYNKIR ETNNLPAICG RVCPQENQCE KFCVRGIKGE PVAIGRLERF AADWHMKNGT TSYEKPEKNG KRVAVIGSGP ASLTCASDLA KLGYEVTIFE AFHVPGGVLM YGIPEFRLPK KLVQEEIETI KQLGVEIKTN MVIGKVYSID ELKAEGYDAI FIGSGAGLPS FMKIPGENLN GVYSANEFLT RINLMKAYEF PNCDTPVKVG KNVAVVGGGN VAMDAARSAK RLGAENVYIV YRRSEAEMPA RLEEIHHAKE EGILFKFLTN PTRILGTDDG WVKGMECIEM ELGEPDESGR RRPVPKPGSE HVIDVETVII AIGQTPNPLI ASTTPGLATQ KWGGIIVDEN TGATNIEGVY AGGDAVTGAA TVILAMGAGK KAAKAIDEYL KNKK
|
| |