Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0821 |
Symbol | |
ID | 4810439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 997660 |
End bp | 999336 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106238 |
Product | coagulation factor 5/8 type-like protein |
Protein accession | YP_001037249 |
Protein GI | 125973339 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000172205 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA GGGTGTGCAT TCTTTTGGCT GTTGTCATTT TAATGACTGT CAGCCTGCCG TTTGTAACAC TGGGGGCGGA TGACATTTAT CCGGGACTTA GAGTTCAGGG GAGGTTTTTG TACGACAAAT ACGGTGAAAA AATAATACTG TATGGTGTGA ACGAAATGTC GATCTGGGGC GACATTGACG GTGATGTGGC ACTTCCTGAA ATCAGGAAAA CCGGAGCTAA TGCGGTAAGA CTTGTATGGT CAGTTTCCGG GCCGGCAAGG AAACTTGATA TTTTGCTTTA CAATTGCCGT ATAAACAATA TGATTCCTAT TATAGAGCTT CATGACGCAA CCGGTGAATG GCAGAAGCTT TCCATGCTCG TTGACTATTG GACAAGACCC GATGTGCTTG AAGTATTGAA AAAACATCAG GAATATCTCA TTATCAATAT TGGTAATGAA GTAGGAGCCC AGGTTTCGGA ATCTGAATTT AAAACAGGGT ATGAGGCTGC CGTAAAGAGA ATGAGGGAAG CTGGAATTCA TGTTCCTCTT ATGATTGACG GCAGCGACTG GGGAAAGAAT ATTGATATTC TTCAGGCAAC CGGACCTTAT CTTATAAACG CCGACCCTGA CAAAAACCTC TTGTTCTCGG TACATATGTG GTGGCCGTAC ATGTGGGGTT ACAATGAACA GAAAGTAATC GATGAAATAA AAGAATCTGT TGAAATGGGC TTGCCCCTTG TGGTGGGAGA ATTCGGACAC CAATGGGAAG AACATGAAAT GGGTAAAATC CCTTACAAAA CCATAATGGA GCAATGTTAC AAGAATGAAA TAGGTTACCT GGCATGGTCA TGGGGACCGG GCAACCAGCC GCAGACTTTC CTGGATATGA CCACCGACGG AACTTTTGAT ACATTGAGAG GATGGGGATT GGAAGTTTGC GTAACGGATC CTTACAGCAT AATGAATATT GCGGTTCGTC CGGCTTCAAT GCTTGAAGAG CCTACAACGG AGCCTCCGGT AATAAACATT CCCGCCGGAA GTATTGCGCA GAACAAGCCT GTATATGCAT CTTCGACGGA ACCAGGGCTG GGTAACACAC CGGAGAAGGC TGTTGACGGA AATATAGCAA CCAGATGGTC ATCGGATTAC AGCGATAATC AGTACATATA TGTTGACCTT CTGGATGAAT ATGAAATTGA AAGAGTGTAT ATTGAGTGGG AGGCTGCATA CGCCCGTCAG TACAAAATCC AGGTGTCCAA TGACGCCGTC ACATGGACCG ATGTATACAC CGAGTACAAC GGAGACGGAG ATATAGACGA TATTTACCTT GAAGCAAGGG GAAGATATGT AAGAATCTAT TGTATGCAAA GGGCAACTCA GTACGGAAAC TCCATATTTG AGTTGGGGGT TTATCCAAAG GGTGGAATTG CGGAACCGAC TCCTCCGGGG GTTGAAATTC TGGTGGGAGA TATAAACAGA GACGGCAAAA TTAATTCGAC GGACTTGGGT ATGTTGAACA GACATATATT GAAACTTGTA ATTCTTGATG ATAATTTAAA GCTTGCGGCA GCTGACATTG ACGGAAACGG AAATATAAAT TCCACTGATT ATTCATGGCT GAAAAAATAT ATATTAAAAG TAATTTCTGA ATTCCCGGGA GGGGATACGA GAATAGTGAC ACCATGA
|
Protein sequence | MKKRVCILLA VVILMTVSLP FVTLGADDIY PGLRVQGRFL YDKYGEKIIL YGVNEMSIWG DIDGDVALPE IRKTGANAVR LVWSVSGPAR KLDILLYNCR INNMIPIIEL HDATGEWQKL SMLVDYWTRP DVLEVLKKHQ EYLIINIGNE VGAQVSESEF KTGYEAAVKR MREAGIHVPL MIDGSDWGKN IDILQATGPY LINADPDKNL LFSVHMWWPY MWGYNEQKVI DEIKESVEMG LPLVVGEFGH QWEEHEMGKI PYKTIMEQCY KNEIGYLAWS WGPGNQPQTF LDMTTDGTFD TLRGWGLEVC VTDPYSIMNI AVRPASMLEE PTTEPPVINI PAGSIAQNKP VYASSTEPGL GNTPEKAVDG NIATRWSSDY SDNQYIYVDL LDEYEIERVY IEWEAAYARQ YKIQVSNDAV TWTDVYTEYN GDGDIDDIYL EARGRYVRIY CMQRATQYGN SIFELGVYPK GGIAEPTPPG VEILVGDINR DGKINSTDLG MLNRHILKLV ILDDNLKLAA ADIDGNGNIN STDYSWLKKY ILKVISEFPG GDTRIVTP
|
| |