Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0629 |
Symbol | |
ID | 4808158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 776050 |
End bp | 777495 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106043 |
Product | type II secretion system protein E |
Protein accession | YP_001037057 |
Protein GI | 125973147 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4962] Flp pilus assembly protein, ATPase CpaF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTATCGA GAAGTACTGT AAGTGAACTT CAAAAGAAAG CAAAGTATAT ACATAGAACC AATACTTCAG GAAGCAAGCT CACATCCATG ACTTTTGAAG AATGTCTAAA GCAGTGTCAG CAGTATATAA GCAATGTGGC GACAAATTAT TACAGAAGAA TTGAAGACCC GGCCAAAAAG AGGGAAATGA CGGAAAGGTA TATAATTGAC TATGTTGAAA CACAGAAGCC TGAAGTGGAA GGATTTGAGG AACTTTCGGC CTTAAGAAAT GCCCTTATTG ATGAGATTAC CCAATACGGT CCGATAACAG ATGCGATGGA AGACCCTCTA ATAGACGAAA TCAGAGCCAA TGGTCCGCAG CAGATATTTG TAGAGAAAAA GGGTAGAACC GAGATTTGGG ACAAATGCTT TACCGACAGG GAGCATATGG AAAGAATAAT TGCAAAGCTT ATTGGAGTAT CCAAGGTGAG GCTTACCCCA AGAACGCCAA TGGTAAATGC CCGTACCATA GAAGGCTACA GGGTCAATGC CACCCATGCG GATATTTCAC CTTACGGCAA TCCGGCGTTT GTGGTAAGAA AGTTTAAAAA GTACACTCTG ACTCCGGAAG AGATGATAAA GGAAAAATCA TTTTCAGTTA ATATGTACAA GCTTCTTTCG CTTATACCCA AGGCCAATCT TTCCTGGATG ACGGCCGGGC CTACCGGCAG CGGAAAGACC ACTTTGAATG AAGTTTTGAT AAAGCATATT GATCCGCTGT GTCGTATTAT AACAATTGAA AACCCTGCAG AGTACAGGCT GCTAAGGTAC AAAAACGATG ACCCCAATGA AAGGGTTATA AATGACGTGC TTCAGTATGA GTGTGTTCCT GATGATGACG ACAGCAGTCC GGCAACAATG GAAAATTTGT TGATAAATGC CATGAGACAG TCTCCTCACT GGATAGGTCC CGGTGAGCTT CGTTCTCCGG GAGAGTTTGA GACTGCCCTG CGCGCTGCGC AAACCGGTCA CTATTTCTTT ACAACCCTGC ATGCGGAAGG AGACCAGGAA GCAATTTACA GGTTCCTTAC TGCTTACCTT ATAAAATCAA AAGAACCTGC GGAACTGGCC CTGAGAAATA TATGTACCGC CGTTAAATTT GTCATATTCC AGGAAAAGCT GGCCGACGGA ACCCGTAAAG TTACATCAAT TTCTGAAGTC AGAGGTTCTG AAGGCTTAAA GCCTATTATT AACCAGATAT ACAAATTTAT TCCTGAGGAT GTTGATGAGG ATCCTGTTAC AGGGCAAATC AGAGCAATAG TAGGCAAGCA CAAGAGAGTA GGCCGTATAT CGGATGAGCT TGTTGACAGA ATGATGAAAG CAGGAATAAA AAAGAGTAAG TTTGAATTCC TCACGCGTCC GGTTGACCCC GATGAAGAGG AGGTTTACGA TATAGATGCT CTTTAA
|
Protein sequence | MLSRSTVSEL QKKAKYIHRT NTSGSKLTSM TFEECLKQCQ QYISNVATNY YRRIEDPAKK REMTERYIID YVETQKPEVE GFEELSALRN ALIDEITQYG PITDAMEDPL IDEIRANGPQ QIFVEKKGRT EIWDKCFTDR EHMERIIAKL IGVSKVRLTP RTPMVNARTI EGYRVNATHA DISPYGNPAF VVRKFKKYTL TPEEMIKEKS FSVNMYKLLS LIPKANLSWM TAGPTGSGKT TLNEVLIKHI DPLCRIITIE NPAEYRLLRY KNDDPNERVI NDVLQYECVP DDDDSSPATM ENLLINAMRQ SPHWIGPGEL RSPGEFETAL RAAQTGHYFF TTLHAEGDQE AIYRFLTAYL IKSKEPAELA LRNICTAVKF VIFQEKLADG TRKVTSISEV RGSEGLKPII NQIYKFIPED VDEDPVTGQI RAIVGKHKRV GRISDELVDR MMKAGIKKSK FEFLTRPVDP DEEEVYDIDA L
|
| |