Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2423 |
Symbol | |
ID | 4808139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2893768 |
End bp | 2896764 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640107837 |
Product | hypothetical protein |
Protein accession | YP_001038818 |
Protein GI | 125974908 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAA AGTTTTATCT TCGAACGTTT GCGCTTCTGA TAGTTCTCTG CGTTTTTGTC AACCTTTCAT TAATAGCAGG TTTTAATAGG CTTTTTGTAG AAAATGTTTA TGCAGATGAA GGTGTTGAAT TTCCGGTTTC AAGCAACAAG GTTTATGTAA CATTGAAAAA CATCAAAACA GGTGTGCCAT CAGATACTAT AGCCTTGAAG ATTGGTATTA TCAATTTAAA TAAAGCAATA AACATAAATT TGAATGATAT TAAACTAAGA TATTACTTTA CTAATGACGG CTGCTCCCCT ATACAGGTTA ATATAAAATT ATTTGGCACA GAAACGGAGA GTTTCAACCC TGAACTGGTT AAAACTTCAG TAGTGACAGG TTTGTCCTAT CCGGGTGCTG ACAGCTATGT TGAAATAGGA TTTACCGGTT CTGTAGAGTT AAATTGTGAT CGCAAACCTA TATACATTGA ACTTGATATC AAAGAAAACA GCCCTGATCG TAACTTCGAT CAATCCAATG ATTTTTCCAA TAATAATTAC TATACTCCCT TTTTGCCTGA AGAGTTTTTT GCATCGGGAA GAGTGCCGGT TTTCATGTAC GATCCAAAAA AACGCGACTA TGTGCTTTTG ACCGGTGTAC TCCCCAGCGA AACATCCAAG ATTCCGAATC CAACTCCGAC GCCTGTAGTA TCTCCTTCTC CTTCACCAAT AGTACCCGAA GGTGAAAAAA TACTTGCTAC TGCTTCAGGA AATATCATTA TACCAAAACC AAGTCCTGCC CCTGTTGGAT TATTTGAACC GGCTGATATT GGATTCCCGC ATGAAGGTTC CATAGATGTG GGTTTCTCTC CACGCAAAAA AGAAGCCTTG ATTTTGCTTG ATTCATCTTA TGAATCCAAC GATGTAGATG ATGGACCGAC AGGTATTTTT AAATATTGTC TGTTCTCCAG TGGTGATTCT TTATACCAGG GAGACAATAT AACAATTGAA GGGGATGTGT TTACAAGAAA TACAATGAAT GTTACGACTT CAGGTATTAA AATAACAGGA AAAGTTGAAT ACTCATTTAG AGATCGAAAC TCATATGCTG GTCCTTTGGG TGGTAAAGAA ATAGAGATGG AACCGGCTGA TGCAGAGAGG TATGATCGTT ATCTTGTACC TGATGAAAAT GACGACTATT CAGCATTATT TTCAATGATA CAGACTAAGG TTACAAATCT TCCTGAAAAA GATGACGCTA AGTTTTTAAT TACAGAAGAC ACAGTCGAAA AATACATTTC TCCCGCAAAT CCTAATTGGC GGAAAAATGC AGTACAGTTT TACTATGATG ACAATGACAA ATCCGTATCT ATTGAATACC GTTCAAAAGA AGCCGGAGGA TTTAGTCGCG AATACGGTAA CTCCTTACCG CAATATTCTA TAAAACAAGA CAAAGGCGAT GCATTTGTAC TTAAATCCAA TATGTTTTTT GACGGCAATC TTATAATAAG CGTCAAAGGT ATTAGGCAAG AATTAATTGA TGGTGCCACA TCTGCTTTTA TATACGCTTA TGGCGATATA ATTCTACAAG GAAATGGAGC AACGTTTAAT GATGTGTATC TTATAACAAA ATATGGAAAC ATTTATATTG AAACAGATAG CTGTAATGTC AACGGCATTG CATTTGCTCC AAACGGAAAA ATTGTCATTA ACGGTCGAAG CAACAATCTA CAAGGTAGTT TCGTTGCAAG AAAAATCCAA TGTGAGCCAG GTAATAGTGT TTTTAAAGGA CCCACTGATG ACCAATTAGA AGATATTGAA GATGCCCTGA AAAGTACAGA AGGTTTTGAC ACCATTAGAA ACTCAATAGC CTTACTTCCC TATATATTTG ATGAATATAC AAGAGCTGGT ATCATAACCT ATTCGGATTA TGCAAACATA AATGACTCAC CGATTAACGA TAGTTGGAAA TTTTTTGATG CTGCTACTGA GCGGGAAGAA TTTTTAAATT ATACTTTGAC ACTGTCAGTG GACGAGGACA GCAAAAGAAG CAATCTTGGT GACGGACTGA GAAAAGCTTT GGATGTATTC AATAAATACT CAGATCCCGA AGCGGACAAA TATATATACA TCTTTACAAG TCTTGATCCA AATGCATACA CCAGATCGCA TCTTTCTGAC GGACTATTTG AAACTGACCC GAAAGTTGAT ACTAATGCGG CCTATATTTA TGATGAAACT GTCAACGGGG AAGGAAACCA ATATGTTAGA GAAATAATGA AATTAATTGA AAAGTATAAT AATAATCACG TCAATGGCAA AATAAAACTT ATACTTGTTG ATTTGACTAA TTATATTAAA GAATTCAATA TAAAGAACGG TGCAAAAGAA TCTGAAATAG AAGTTGACGT TTTAACAAAT CTTGCCTATG ACCTGGGAAT TGACATTTCT GACTCTGATG AAAAAGCTTA CTACTGTCCT TCATTAGAGG ATATACAATC ACTTTCAATT ATAAACGAAT TGGCATATCG TTCAAACAGT ATGCCGCCTA AACTTGCTGT GGAAAACTTG AAAATCAGTT CAGCACAATT TGAACTGTCA CTGCCAAGTT ACATCAAACC GGTTGAATTG TTCTTCAAGA GGGCAAGTAA TACAAAAGAG TCAATTGTTA ACTTGTCAGG ACTTGCAGCA TCTGGTGGCA AATACAATAT AACCTATACT TTCAGCGGCG ATGAGCTGGC AACCCTTACA AGGATTAGCG ACGGCTTGAA ATATGACTTG GAAAGCAACG GATTATACAT GACCCTGATT GTCAACAGTA GTGACGATTG GGACGATGGA GATAATCCTC TGACCGTTAA AGGTACCGTT GACATTGCCG GACCTAAAAT AACATATAAA TTGTTTGATG ACAAAAATAA TGACGGCGTA AGGTCCGCAG GTGAAGCTGA ATTTGAAGTT GTAGTGCCGT TCGACAATAT TAAATTCAAT GTAGAGTACA AGAAGGATAT CAACTAA
|
Protein sequence | MKRKFYLRTF ALLIVLCVFV NLSLIAGFNR LFVENVYADE GVEFPVSSNK VYVTLKNIKT GVPSDTIALK IGIINLNKAI NINLNDIKLR YYFTNDGCSP IQVNIKLFGT ETESFNPELV KTSVVTGLSY PGADSYVEIG FTGSVELNCD RKPIYIELDI KENSPDRNFD QSNDFSNNNY YTPFLPEEFF ASGRVPVFMY DPKKRDYVLL TGVLPSETSK IPNPTPTPVV SPSPSPIVPE GEKILATASG NIIIPKPSPA PVGLFEPADI GFPHEGSIDV GFSPRKKEAL ILLDSSYESN DVDDGPTGIF KYCLFSSGDS LYQGDNITIE GDVFTRNTMN VTTSGIKITG KVEYSFRDRN SYAGPLGGKE IEMEPADAER YDRYLVPDEN DDYSALFSMI QTKVTNLPEK DDAKFLITED TVEKYISPAN PNWRKNAVQF YYDDNDKSVS IEYRSKEAGG FSREYGNSLP QYSIKQDKGD AFVLKSNMFF DGNLIISVKG IRQELIDGAT SAFIYAYGDI ILQGNGATFN DVYLITKYGN IYIETDSCNV NGIAFAPNGK IVINGRSNNL QGSFVARKIQ CEPGNSVFKG PTDDQLEDIE DALKSTEGFD TIRNSIALLP YIFDEYTRAG IITYSDYANI NDSPINDSWK FFDAATEREE FLNYTLTLSV DEDSKRSNLG DGLRKALDVF NKYSDPEADK YIYIFTSLDP NAYTRSHLSD GLFETDPKVD TNAAYIYDET VNGEGNQYVR EIMKLIEKYN NNHVNGKIKL ILVDLTNYIK EFNIKNGAKE SEIEVDVLTN LAYDLGIDIS DSDEKAYYCP SLEDIQSLSI INELAYRSNS MPPKLAVENL KISSAQFELS LPSYIKPVEL FFKRASNTKE SIVNLSGLAA SGGKYNITYT FSGDELATLT RISDGLKYDL ESNGLYMTLI VNSSDDWDDG DNPLTVKGTV DIAGPKITYK LFDDKNNDGV RSAGEAEFEV VVPFDNIKFN VEYKKDIN
|
| |