Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1631 |
Symbol | |
ID | 4809326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1958009 |
End bp | 1959793 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640107047 |
Product | hypothetical protein |
Protein accession | YP_001038048 |
Protein GI | 125974138 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000289868 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAATTTTT TGACAAAACG TGTTATAATT CCTCCAAATA AGATTATTAT CGGAGGAGTA ACTTTGGATC GTATAACTCA GGCTTTTTTA GATGAATTTT CTATTAGCCA TAATTTTACT ATGTACGAAA CAAGTGTTCA ATTCGAGCAT TTTGCCAATT TCTGTGCTTT ATCTGCAGAA ACTGGCATGG TTGAAATAGA TATTCAAGAC ATGCATACTG GCAATGCTAC TCAAGGTATT GATGGAATAG CGATAGAAGT AAACGGAGCT ATTGTGTGCA GTATTGATGA GATAGAAACG TTAATTAAGC AAAATAAAAA ACTTGATGTA AAATTTATAT TTGTACAAGC AAAAACATCT GACAGTTTTG ATAATTCAGA GATAAGTAAC TTTTTGTCTT TTGTTAAAGT CTTTTTTTCT GATGAAGCTA AGAATACATT CAGTACAGAG GAAATGGCAG ACTTCATAGA AATGAAGGAT TTTATTTATA GTAATTCTCG CTATATGAAA GTCAAAAACC CGATAATACG ACTTTATTAT ATTGCTCCGG GGAAGTGGAA CGATGATGAT TCCAATTTGA AAGCTGTAAT TAATAGTCAT ATAGATACGC TTAATAACAT GGCACTTTTT TCATCTGTGG AATTTATACC TTGCGGTGCC CAAGAAATAC AGCGTATGTA TAGAAAATCA CAAGAGCAAA TAGAAGCGAC TTTTGTTTTT ACAAAAAATG TGATGATGTT TTCTGATGAT AATGGAGATT ATGGATATAG TGGGGTACTG CCATTCTGCG AATTTTATAA AATTATATGC GATGAAAATG GTTCACTAAA AAAAGTATTT GAAGATAATA TTCGGGACTT CCTTGGAGTG AATAATTATG TTAATGCGGA CATTGAAGAA ACTATTGTTG AAGGCAGAAA TAGCGCTTTT TGCATGCTAA ATAATGGAAT AACAATTGTC GCTCATTCTG CTGTGCTTGT GAGCGATAAG ATGACAATTT CAAACTATCA GATAGTTAAT GGATGCCAAA CCAGCCATGT TTTGTATCTT AACCGTGATA ATCTTGGAAT ACATGATTTA CTTATACCGA TTAAGATTAT TGTAACCAAG GATGAGGACT TAAAAAACCG TATTACAAAA GCTACAAATA ATCAAACTGG TATAACCAAA GAACAATTAG AAGCGTTATC AACTTTCCAA AAAACACTGG AGGAATACTA TCGCACATAC ACTGCTGAGG ATGAACGCTT GTATTATGAA CGTCGTTCAG GACAATATAG GAATGAATCG ATTCCCAAAG ATCGAATAGT TACTATTCGT GCCCAGTTAA AAAATGCATC ATCAATGTTC AATGATAAAC CACACGACGC TGCTGGTCAT TATAGTAGCT TATTGAAAGA TATTGGAAAC CGTATTTTTC TACCTGACGA CCAGCCTATA TTGTATTACA CAAGTTCTTT GGCCATGTTT CGTTTCGAAA ACCTGATAAA AACAAAATGT ATTGATAAAA AATACCGTAA AGGAAAGTAT CATGCCATAA TGCTTTTAAA GTATATGGCA ACAAACAACT TACCAAAACA TCATAGTGCC AAAAAAATGA TCAATGCTTG CAATCAAATT TTGCGTATCT TGAATGATTC AGGGAAATGT CTCGATTATT TTTTAAGAAT AATTGAATTC ATTGAAACAC AAAAAGAATT AGATTTGACG GATCGTAAAT TGTTTGAACG GAAAGAAACA ACAGATATTT TACTACAAAA TAAGGATAAG TTAATAAGAA GTTAA
|
Protein sequence | MNFLTKRVII PPNKIIIGGV TLDRITQAFL DEFSISHNFT MYETSVQFEH FANFCALSAE TGMVEIDIQD MHTGNATQGI DGIAIEVNGA IVCSIDEIET LIKQNKKLDV KFIFVQAKTS DSFDNSEISN FLSFVKVFFS DEAKNTFSTE EMADFIEMKD FIYSNSRYMK VKNPIIRLYY IAPGKWNDDD SNLKAVINSH IDTLNNMALF SSVEFIPCGA QEIQRMYRKS QEQIEATFVF TKNVMMFSDD NGDYGYSGVL PFCEFYKIIC DENGSLKKVF EDNIRDFLGV NNYVNADIEE TIVEGRNSAF CMLNNGITIV AHSAVLVSDK MTISNYQIVN GCQTSHVLYL NRDNLGIHDL LIPIKIIVTK DEDLKNRITK ATNNQTGITK EQLEALSTFQ KTLEEYYRTY TAEDERLYYE RRSGQYRNES IPKDRIVTIR AQLKNASSMF NDKPHDAAGH YSSLLKDIGN RIFLPDDQPI LYYTSSLAMF RFENLIKTKC IDKKYRKGKY HAIMLLKYMA TNNLPKHHSA KKMINACNQI LRILNDSGKC LDYFLRIIEF IETQKELDLT DRKLFERKET TDILLQNKDK LIRS
|
| |