Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1765 |
Symbol | |
ID | 4810009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2087115 |
End bp | 2088266 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107178 |
Product | hypothetical protein |
Protein accession | YP_001038179 |
Protein GI | 125974269 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1538] Outer membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000291236 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAAAAG TCCTGTCCGC AATAATCATT ATTTTTATGT TAGCTTTCAT GCAGGCCTAT GCGGCTAATG ACAGCAGGCT GTCCTATGAC ACGGCAAAAG AAGTCATGCT GAAAAACAGC AGGGCCGTGG CAAAGCAGAA GCTGTCGGAA AGGAAAGCCT TTTATCAGTA CAACGGTGTG GTGCAGCGTA CCCGGGGAAT TGAGACGGAG ATGACTGTCA TAGATACTCC TATGGGAAAA TACTATTATG TTTATCCTCC GAATATACAG GTGCTTCTGA CCAAACAGGC TGAACTTCTG CCCCTTCAGA TGAGATATTA CTGGAGAATG GCCGACAACG GCAGGATTGT TACCGAAAAG GCACTTTCAC TGGGGCTTCG GGATTTGTAC CTCGGATTTA TGAAATCCGA CATGGATTAT CGGCTGAGTT TGGAGAAGCT CGAGCTTCAG GAAAAGAAAT ACAATGCCGC AAAACTTAAA GCTGAAAAAG GGCTGATTTC AGGGATTGAA CTTGAAGAAG CGGAGTATGA TTACCTGAAA GCAAAAAAAG ATGTTGAAAA ATATAAAAGA AGCCGGGAGA ACATGCAAAG GAGCATAAAC TCCTACATAG GTGTGCCAAT TGACACGGCC TATGATAAAG TATTATTTTC CGAATATACA AGAAATCTTG TGGTAAAACC TTTGGAATAT TATACCGAGG CGGCATTGGA GAACAGGCTT GAGATAATTT CCGTAGCGGA GGAGATAAAG ACAAAAGAAA AGCATTTGGA GATCCTTGAA ATAGGCAGGG CAAAGGATAT ATATCCTGAC ATACGCAAAG AGTATGAAGA TGTTTTGCTG GAAATAGAGA CTTTGAAAGT CAAGCTTGAG AAAGCCAGGT ATGATGTTGA AAACAACATA AAGTCTGCGT ACATAGATGT AATAAAAGAA AAAGACAACA TGGATAATTT GATGGCAACG TTGAATATGC AAAAAAGGAA TTTTGAAAGA CTTAAAGCCC GGTATGAGCA GGGTTTCATA CCTGAAACGG TAATTGAAGA AATGGAGCTT GCAATCGAGG AATTGCAAAA CGGAGTTAAT CTTACGGTTT ACAACTATAA TACGAAAAAA ATGAAGCTTG AAGAGGCGGC GGGCTTGGGT CCGGCGTATT AG
|
Protein sequence | MKKVLSAIII IFMLAFMQAY AANDSRLSYD TAKEVMLKNS RAVAKQKLSE RKAFYQYNGV VQRTRGIETE MTVIDTPMGK YYYVYPPNIQ VLLTKQAELL PLQMRYYWRM ADNGRIVTEK ALSLGLRDLY LGFMKSDMDY RLSLEKLELQ EKKYNAAKLK AEKGLISGIE LEEAEYDYLK AKKDVEKYKR SRENMQRSIN SYIGVPIDTA YDKVLFSEYT RNLVVKPLEY YTEAALENRL EIISVAEEIK TKEKHLEILE IGRAKDIYPD IRKEYEDVLL EIETLKVKLE KARYDVENNI KSAYIDVIKE KDNMDNLMAT LNMQKRNFER LKARYEQGFI PETVIEEMEL AIEELQNGVN LTVYNYNTKK MKLEEAAGLG PAY
|
| |