Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0613 |
Symbol | |
ID | 4808215 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 752381 |
End bp | 754186 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106027 |
Product | thiamine pyrophosphate enzyme-like TPP-binding |
Protein accession | YP_001037041 |
Protein GI | 125973131 |
COG category | [C] Energy production and conversion |
COG ID | [COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits |
TIGRFAM ID | [TIGR03336] indolepyruvate ferredoxin oxidoreductase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0102763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGAAT TATGTGATAA GATAGAAATA ATATTATTCC ATTATTTATG GTTAGATGGG GGATTGTTGA TGAAAAAACT AATGCTCGGA AACGAGGCGG TAGCCAGAGG AGCTTATGAA GCAGGAGTCA GAGTGGCAAC GGCTTACCCT GGAACGCCAA GTACCGAAAT AACGGAAAGT ATAGCAAAAT ACGACGAAGT GTATTGTGAA TGGTCACCGA ATGAAAAAGT TGCTCTGGAA GTAGCTATAG GCTCGGCAAT TGCCGGTGGA AGAGCAATTT GTTCGATGAA GCATGTCGGA CTTAATGTGG CGGCCGATCC TTTGTTCACC GTATCCTACA CCGGAGTTAA TGCAGGACTG GTTATCATGG TTGCGGATGA TCCGGGAATG CATAGCTCTC AAAACGAACA GGACAGCAGA ATGTATGCAA AAGCTGCAAA AGTGCCCATG GTTGAGCCTG CCGACAGTAG AGAGTGTAAA GAATATGTGA AGGCGGCTTT TGAAATAAGC GAAAAATTTG ATACTCCGGT TATTTTCAGA CTTTCGACTA GGATATCCCA TTCACAGAGT GTTGTTGAAA TCGGTGAAAG AGAGGATGTA CCGCTTAAGG AATACAAAAA GGATCCCGAA AAATATGTCA TGATGCCTGC AATGGCGCGT AAAAGACATG TTGCGGTAGA AAGCAGAATG GCTGCTTTGG CCGAGTTCTC CAATACAACT CCTCTAAACA GGATTGAATG GGGAAGCACG GATATCGGTG TAATAACCAG CGGAATTGCC TACCAGTATG CAAGAGAAGC CTTCGGTGAT GTCTCATATC TCAAGCTCGG AATGATTTAT CCACTTCCGG ACAAGTTAAT TAAAGAGTTT TCCGAAAAAG TAAAAGTATT ATATGTTATA GAGGAACTGG AACCGTTTTT TGAAGAGCAT ATAAGAAAAC TGGGAATCAA GGTTATAGGC AAGGATTTAC TGCCCGTTAC AGGGGAATAC AGTGCCAACC TTATCAGGGA AAAAGTATTT GGAAAACAAA TCGGCGGAAA GAAAATTGAG GGAGTCCAGG TTCCGGTCAG ACCTCCTGTA ATGTGTCCGG GATGCCCGCA CAGGGGAATG TTTTATGTGT TAAACAAACT GAAACTTACC GTAAGCGGAG ACATCGGATG TTACACATTG GGAGCGTTGC CGCCCACCCA GGCGATGGAC ACTTGTATCT GTATGGGTGC AAGTATCGGT GCGGCTCACG GTATGGAAAA AGCAAGGGGC AGAGAGTTTA GAACAAAAAC TGTTGCCATA CTGGGAGATT CCACATTTGT GCATTCGGGA ATAACCGGAC TTATTGACGT GGTTTACAAT AAAGGAAATT CCACGGTGAT AATACTGGAC AACTCCATTA CCGGTATGAC AGGTCATCAG CACAATCCTA CAACCGGTTT TACAATAAAG GGAGAACCCA CAAAACAGGT GGATCTTGTA AAGCTTTGCA ATGCCGTTGG AATTGACAGG GTCAAGGTTG TAGATCCTTT TAATATAAAA GAGTTTGAAA AAACCGTAAA AGAAGAAATT GAAGCTGATG AGCCTTCGGT CATAATTTCC CAAAGACCGT GTGCCCTTTT AAAACATGTA AAATTTGAAG GCCCGCACAG GATAGTCATT GAAAAGTGCA GAAAATGCAA GATGTGTATG AGAATAGGAT GTCCTGCCAT TGTTGACATG GGGGACCATA TAGAAATAAA TGATGCTTTG TGTGTGGGTT GCGGGCTGTG TTCAAAAGTA TGTAATTTCG ATGCGATTGA AAAGGCTGGT GAATAA
|
Protein sequence | MLELCDKIEI ILFHYLWLDG GLLMKKLMLG NEAVARGAYE AGVRVATAYP GTPSTEITES IAKYDEVYCE WSPNEKVALE VAIGSAIAGG RAICSMKHVG LNVAADPLFT VSYTGVNAGL VIMVADDPGM HSSQNEQDSR MYAKAAKVPM VEPADSRECK EYVKAAFEIS EKFDTPVIFR LSTRISHSQS VVEIGEREDV PLKEYKKDPE KYVMMPAMAR KRHVAVESRM AALAEFSNTT PLNRIEWGST DIGVITSGIA YQYAREAFGD VSYLKLGMIY PLPDKLIKEF SEKVKVLYVI EELEPFFEEH IRKLGIKVIG KDLLPVTGEY SANLIREKVF GKQIGGKKIE GVQVPVRPPV MCPGCPHRGM FYVLNKLKLT VSGDIGCYTL GALPPTQAMD TCICMGASIG AAHGMEKARG REFRTKTVAI LGDSTFVHSG ITGLIDVVYN KGNSTVIILD NSITGMTGHQ HNPTTGFTIK GEPTKQVDLV KLCNAVGIDR VKVVDPFNIK EFEKTVKEEI EADEPSVIIS QRPCALLKHV KFEGPHRIVI EKCRKCKMCM RIGCPAIVDM GDHIEINDAL CVGCGLCSKV CNFDAIEKAG E
|
| |