Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2974 |
Symbol | |
ID | 4810862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3493240 |
End bp | 3494451 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640108396 |
Product | hypothetical protein |
Protein accession | YP_001039364 |
Protein GI | 125975454 |
COG category | [S] Function unknown |
COG ID | [COG5373] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAAAC AAAAAGGTAC TATTTTAAAG CTTAAAAACA ATTTGGCCAT TATCATGACC AGTGACTGCA AAATTGTTTC AATAAAGAGA CAGCCAGGCA TGTATGAGGG TTTGGAAATA TCGTTCAATA AAAACGAAAT TATAAATAAA AAGAACAAAC TGGCTTTTTA TTCCCGAATT GCCGCAGGAA TCGCCGCAAT ATTCATAATC ATGGTTATCT CTTTCAATTT ATTTAATAAT AATGATGTAT ATGCTTATGT TGCCATAGAT TCCGATGCCA GCATAGAATT TGAACTGGAT AAAAACAATA AAATAGTCAA AGTGAATTAC TATAATGATA ATACAAATAC TGTATTGGAT GAATTAGATT TAAAGAATAA ACCCGTTGAT TTTGCAATAA AAGAGGTAAT AAAAAAACTG GACTTAAATG AATCCGTTAT TTTGATATCA GCATGTTTGA AAGAACAAAA CACAAAAAAG TCCTCCGCTT CCGATAATTA TGAGTCTGAA AAATTAAGTA AATTAATTGA TATTTGTAAA AATGCCGTTG AGGTCAATGT AAGTGAAAAT GTTGAGTCAA AAGTGGTGGA AGTTTCCTAC GATTATAAAA AACTGGCTGA AAAAAACAAA CTCTCCCTAG GTCGAAGCAT TGTCTATGAA AAAGCCAAAG AGCAAGGGAT AGCTCTGAAT ATCGAAGACA TAAAAAACAA AAGCATTGGA GAGACTTTAC AGAAGGTCAA AATTGACGAT GTCGGCGTTG TACACAACGT AAAAAAAGAG GAACCAAAAA AGCCTATGCC GGAAAAGCCT GAACCTGGAA AGCCCGAACC GCAAAAACCA GAACCCGGAA AACCTGACCC GGCAAAACCC GAACCGGGAA AACCCGGACC GGAAAAGCCC GAGCCGGAAA AGCCTGAGCC GGCAAAGCCT GAGCCGGCAA AACCTGAGCC GCAACCACAA ATAAATGATT TGCCAAAAGA TAAAACCATA CCGGAAGAGA AAACAATTCC GAATTCCGGA GTTGAACCAA TGGCCGAGCC GATAGTTGAA CCAAAAGACA AACAGCAGGA AAAACCCAGG CCCGATTCAA AGCTTAAACT TGAAGAAAAA CCTACGGTTG AACCAAAAGA CTCCTTGGAA GAAAAACCCG TGACAAAACC AAAGGATGAC AAAAAGGAAA AAGCAAAGAA CAGCATTGAA AAAATGCCAT AG
|
Protein sequence | MTKQKGTILK LKNNLAIIMT SDCKIVSIKR QPGMYEGLEI SFNKNEIINK KNKLAFYSRI AAGIAAIFII MVISFNLFNN NDVYAYVAID SDASIEFELD KNNKIVKVNY YNDNTNTVLD ELDLKNKPVD FAIKEVIKKL DLNESVILIS ACLKEQNTKK SSASDNYESE KLSKLIDICK NAVEVNVSEN VESKVVEVSY DYKKLAEKNK LSLGRSIVYE KAKEQGIALN IEDIKNKSIG ETLQKVKIDD VGVVHNVKKE EPKKPMPEKP EPGKPEPQKP EPGKPDPAKP EPGKPGPEKP EPEKPEPAKP EPAKPEPQPQ INDLPKDKTI PEEKTIPNSG VEPMAEPIVE PKDKQQEKPR PDSKLKLEEK PTVEPKDSLE EKPVTKPKDD KKEKAKNSIE KMP
|
| |