Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1185 |
Symbol | |
ID | 4810137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1413880 |
End bp | 1415160 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106607 |
Product | hypothetical protein |
Protein accession | YP_001037610 |
Protein GI | 125973700 |
COG category | [R] General function prediction only |
COG ID | [COG4908] Uncharacterized protein containing a NRPS condensation (elongation) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000105816 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTATC CTAAAAAAGT GGAATGGAGC AGGCTGGACA ATGCTTCAAA ATATTTTGCG GCAACATACA GTGAGAGGGA TGAAAAGGTA TTCAGAATCT CATGTGAGTT GTTTGAGGAA GTAGATCCGG AAATTTTGCA ACAAGCTCTT GATGAGACTA TTGAGAGATT TCCGTATTAT AAATCCGTTT TAAGAAGAGG AATATTTTGG TATTACCTTG AGGACAGCGA TATCAGGCCT TTGGTTGAAA AAGAGGATAA ACCGGTCTGC GCACCGATTT ACAGAAAATA CGAAAGAAAT CTGTTGTTTA GAGTTCTTTA CTACAACAAA AGAATCAGTC TCGAAGTATT TCATGCCCTT TCGGACGGAA CAGGGGCTCT TAGGTTTATG ATGACACTGG TTTACCATTA TTTGACGATC AAACACAAAG ATGAGTTTTC CGGCAAAATA CCCGAATTAA ATTACAATGC ATCCATCGGC GAAAAAAAGG ACGACAGTTT CGAACGGTAT TATCAAGGCA GGCGTTTTAA AAAGCAGGCA AGGGAAAAGA AAGAAAAAAA GCCGTTTAAG AGAGTATATC GCATACGGGG AACCAGAATT GAGGAAAACA GAATTAAGAT AATAGAAGGC ACAATGTCTG CAAAAGCCGT ATTAAATGAA GCACATAAAT ATAACACGAC AATGACCGTG TTTTTATCGG CGCTGTTGCT TCGCTCAATT TACATGGATA TGCCGGCCCG AAAAAAAGAC TATTCTTTGG TGTTAATAGT ACCTATTAAC CTCAGACAGT TTTTCAAATC GGAAACGGCA AGCAATTTTT TCAGTACGAT GAGCATTGAG TATAAGTTTA CCGAAGAAGG CATGGAGCTT GATAAAATAA TCGCAAGTCT GAATGAGAGT TTTAAAAAAG AACTTACGGA AGAAAGGCTG AGCGAGAAGA TTAACTGGCA AATGTCCATT GAAAAAAATC CTTTTGCCAG AATTATGCCC CTGCCGCTTA AAAATCTCTT TATTCGTATT GCTGATGAAG TGGTGGAAAG CAGAACCACC GCATGCATAT CCAACTTGGG CAAAATACAA ATGCCTCCTG AGTTTGAAAG GTATATCCGA CAGTTCAGTG TCGTACCCAA TGTCAGAAGA CCTCAGATTG CGGTATGTAC ATACGGGGAC AAAATGGCGG TAGCTTTTGG TTCGCCGTTC AAAGAAACCG AGATACAAAA AAATTTTTTC AAATCCCTGT CGGAAATGGG GATTAAAATT GAAATAGTGT CAAACATGTA G
|
Protein sequence | MNYPKKVEWS RLDNASKYFA ATYSERDEKV FRISCELFEE VDPEILQQAL DETIERFPYY KSVLRRGIFW YYLEDSDIRP LVEKEDKPVC APIYRKYERN LLFRVLYYNK RISLEVFHAL SDGTGALRFM MTLVYHYLTI KHKDEFSGKI PELNYNASIG EKKDDSFERY YQGRRFKKQA REKKEKKPFK RVYRIRGTRI EENRIKIIEG TMSAKAVLNE AHKYNTTMTV FLSALLLRSI YMDMPARKKD YSLVLIVPIN LRQFFKSETA SNFFSTMSIE YKFTEEGMEL DKIIASLNES FKKELTEERL SEKINWQMSI EKNPFARIMP LPLKNLFIRI ADEVVESRTT ACISNLGKIQ MPPEFERYIR QFSVVPNVRR PQIAVCTYGD KMAVAFGSPF KETEIQKNFF KSLSEMGIKI EIVSNM
|
| |