Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1761 |
Symbol | |
ID | 4810191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2082568 |
End bp | 2083776 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640107174 |
Product | hypothetical protein |
Protein accession | YP_001038175 |
Protein GI | 125974265 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000119528 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTTC TTGAAAGTAT CAGACAGGCG CTGGACAGCC TTAAAGCAAA CAAGCTGAGA TCCATTCTTA CCATGCTGGG AATTGTCATG GGTGTGTTTT CGATTATTAC CATAATGGCG ATTGGAAATG CCACTGAAGA GTATATTAAC AGCCAATTTG AAAAAATCGG TGCCAATGTG CTTACCGTAG GCTACAAAAA TATGAATGTC GACAGTGATG AGATGCTGTA TCTTAAGGAT ATTGAAACAG TGAAAAGGGC TGCGCCGGAA ATAAAAAATG TTACAACTTA TATTCAGCAT AGGGGAACAC TGCGAATTGA TACAAAAACA AGAAGTGCTT TGGTGTATGG AACCACGGCC CAGTACAAGG ACATTACGCC TATGGAGATG GCTGCGGGAA GATTTTTCAC TGATTTTGAC ATATCTTCGA GACAGAAGGT TGTAGTTGTT GATGAATACT TTGCAAAAAG ATATTTCAAC AGGCTGGACA TAGTAGGTGA AGTCCTGCAA TTTAAAGCGC CTTCCGGGAA CTATAAGGTA AAGGTGATTG GGGTTGCAAA AGCTATTAAT GACGCCATGG CAAATTTATT GGACAATGAA AACTTTCCGA CTCAAATATA TATGCCCATT ACCACGGTTC AGCAAATGTA TTACAATAAT GAAGCTTTAG ACAGTATTCT TGTTGCGCTG GACAAGGAAG TAGACTATAA GGAAGTCGGG AACAGAATAG TAAGGGCTTT GGAGATGAGC AAAGGTAAGA AAGACATATA TATGACATAC AGTACACAGG ATTCTCAGGA AATACTTTCA AGTATAATCG GTGTTGTATC GGCGGTGCTT TTGGTCATTG CAATTATTAC CCTCATAGTC GGAGGTATAG GTATTATAAA CATATTGCTT GTTTCCGTGA CGGAAAGAAT AAGGGAAATA GGCATCAGGA AGGCATTGGG AGCCCAGAAA AAGGACATAA TTTTTCAGTT TATCACAGAA TCAATTATTA TGACGGGAAT AAGCGGCAGT ATAGGAATAT TTCTTGGAGT TTTGGGAGGC AATATAATTT CTCAGGCTAT TCAAATTCCG CCGGTGATTG ATGTTCCTGT AATTATCGGG GTATTTTTAG GGTCGGTGGT ATTGGGTCTC GTATTTGGTG TGTATCCTGC AAAGAAAGCG GCTGACCTGG ATCCTATAGA ATCTCTCAGG TATGAATAG
|
Protein sequence | MSFLESIRQA LDSLKANKLR SILTMLGIVM GVFSIITIMA IGNATEEYIN SQFEKIGANV LTVGYKNMNV DSDEMLYLKD IETVKRAAPE IKNVTTYIQH RGTLRIDTKT RSALVYGTTA QYKDITPMEM AAGRFFTDFD ISSRQKVVVV DEYFAKRYFN RLDIVGEVLQ FKAPSGNYKV KVIGVAKAIN DAMANLLDNE NFPTQIYMPI TTVQQMYYNN EALDSILVAL DKEVDYKEVG NRIVRALEMS KGKKDIYMTY STQDSQEILS SIIGVVSAVL LVIAIITLIV GGIGIINILL VSVTERIREI GIRKALGAQK KDIIFQFITE SIIMTGISGS IGIFLGVLGG NIISQAIQIP PVIDVPVIIG VFLGSVVLGL VFGVYPAKKA ADLDPIESLR YE
|
| |