Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1107 |
Symbol | |
ID | 4811405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1316762 |
End bp | 1319125 |
Gene Length | 2364 bp |
Protein Length | 787 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106529 |
Product | type II secretion system protein E |
Protein accession | YP_001037532 |
Protein GI | 125973622 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | [TIGR02533] general secretory pathway protein E [TIGR02538] type IV-A pilus assembly ATPase PilB |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAGGGG TAGATACAAG AGGTATTGAT GAAATACTTT TGGAAATGGG AGTTTTAAAA ATAGTTGACC TTAAAAAAGC CTGGGATATT CAGAGGGAAA GCAATAAGAA CATTGAGGAC GTGCTATTGG AACTCGGTCT TGTATCCCAG AAAGACATAA TGCATGCCAA TGCCGTGAAG ATGGGTATTC CTTTCGTCGA TCTGTCAACT TATCAAATCA GCGATTCCAG TGTTCCTTTA TTAATTACAC GAAACATAGC AAACCGGTAC AAGGTTATAC CCATAGAAAA GGAAAATGGT GTTTTGACAG TTGCAATGAG TGATCCGACG GATATTTTCT GTATAGATGA CATACGTCTT GCAACTGCTT TGGAAATAAA GCCTGTCCTT GCGGATGTCA AGGAGATAGA AAGACTTATT GTTGAATATT TCGGAGAAGA AAAGAAGCCG CAGGAATCGA AATTAAAAGC GGAAAATGAG GAGCAGAACA AGAAAGAAGA ACTTTTAAAA ATGGAGGAAG AACTTCTTGG AAGAGAGATA TATAATAACA TAAAAGCCGA TGTTGAGACG AGAGAGCCGG AATTTGATTT GGCAAGCAAG GGATTTGATC AGAATACTTA CAACGAAAGC GGTATTTTTA AAGATAAAAT AGGAAACATC CTTGTACGTG CGGGAGTTAT TACACAGGAT CAACTGGAAA ATGCCCTCAG TATACAGAAA AAATCTGGCG GACTTATCGG TCAGATTTTA GTAAAGCAAG GCTATATTGA CAGAAGGTCT CTTTATGAAT TTCTCCAAAA ACAGATGGGA GTCGAATACG TTGATATTGA GGGAATTGAA ATTGATGAGG ATATCATTGG TTTGGTATCC CCGAATTTGG CAAAAACCCA CAAGGTAATC CCTATTGAAA AAGTGGACGG GAATTTAAAG GTTGCGATGA GCGACCCGAT GAACATATTT TCAATTGACG ATTTGAGGCT TACAACCGGC CTTGAAATTA TTCCTTGCCT TGCCGATGAA GAGCAGATTT CGGCACAGTT GGAAAAATAC TACGGAAAAG CTTCCAGGAA GACCAGTGCG AAAGAAATAG AGCAGAAGGT TGCGGATCTT GACGAGGAAA TTAAGAAAGT AAATGAAAAA ATTGCGGTTG AAATAACTCA AACTGAAGAT GAAGATACAA CAATTGACAT TAGCGATCTT GAAAATGCCC CCATTGTCAA GATGGTTAAT ATTATCTTCC AGAAAGCCGT GGCTACAAGA GCAAGTGATA TTCACATAGA ACCCCAGGAA GATTGTGTTT TAATAAGATT CAGAATAGAC GGACAACTGG TAGAGATAAT GAGATATGAC AGAAAGATTC TTTCTTCAAT TGTTGCCAGA ATAAAAATCA TCAGCGGTCT GAATATTGCG GAAAAGAGAA TTCCTCAGGA CGGAAGAATA GGAATAAAAA TTGACGACAG GGAGTATGAC ATGCGTGTTT CCGTTCTGCC TACAATGTTC GGAGAAAAAG TTGTTATAAG AATTGCCGAC AAGGAAGGCT TTAATGTTTC GAAGAAAGAG TTGGGATTCT TTGAAGATGA TTTGGAGAAA TTTGACCAGA TAATATCAAG TCCTTACGGT ATTATACTGG TTACCGGACC TACCGGAAGC GGTAAGTCAA CAACGCTTTA TACCGCGCTG AGGGAATTGT GCAAGCCTAA TGTCAACATA CTTACTGTTG AGGATCCTGT TGAAAGTACT ATAAAGGGTA TAAATCAGGT ACAGGTGAAT GTAAAAGCTG GTTTGACATT TGCTACGGCA CTGAGAGCGT TCTTAAGACA GGACCCCGAC ATAATAATGG TGGGAGAGAT ACGTGACTCG GAAACAGCTG AAATAGCAAT CCGTGCAGCT ATCACGGGAC ACCTTGTTTT CAGTACATTG CACACTAACG ATGCTGCAAG TTCTGTTACA AGAATGATTG ACATGGGAAT AGAGCCGTTC CTGTTGTCTT CGGCTCTGGT TGGATTGATT GCGCAAAGAC TTGTAAGAAG ACTCTGTCCT CATTGCAAGG AAGCTTTCCA GCCGGATAAG AATGAGAGAG AGATTCTTGG CTTGAAAGAT GATGAAGAAG TTACAATATA TCGTGCAAAA GGGTGCGACG AATGTAACAA TACCGGTTAC AAAGGAAGAA TAGCGGTTTA TGAGATTCTG ACGGTCAATA GGGAAATAAA GGAACTGATA TCCAAGAACG TAAGTTCCGA TGTAATAAAA GATGCGGCTA TTAAAATGGG CATGAAAACA TTGAGAATGA ACTGTACGAG GTTGGTTAAA GAAGGTATTA CTACGATAGA TGAGATGCTT CGTATTGCAT ATTCCATAGA CTAG
|
Protein sequence | MLGVDTRGID EILLEMGVLK IVDLKKAWDI QRESNKNIED VLLELGLVSQ KDIMHANAVK MGIPFVDLST YQISDSSVPL LITRNIANRY KVIPIEKENG VLTVAMSDPT DIFCIDDIRL ATALEIKPVL ADVKEIERLI VEYFGEEKKP QESKLKAENE EQNKKEELLK MEEELLGREI YNNIKADVET REPEFDLASK GFDQNTYNES GIFKDKIGNI LVRAGVITQD QLENALSIQK KSGGLIGQIL VKQGYIDRRS LYEFLQKQMG VEYVDIEGIE IDEDIIGLVS PNLAKTHKVI PIEKVDGNLK VAMSDPMNIF SIDDLRLTTG LEIIPCLADE EQISAQLEKY YGKASRKTSA KEIEQKVADL DEEIKKVNEK IAVEITQTED EDTTIDISDL ENAPIVKMVN IIFQKAVATR ASDIHIEPQE DCVLIRFRID GQLVEIMRYD RKILSSIVAR IKIISGLNIA EKRIPQDGRI GIKIDDREYD MRVSVLPTMF GEKVVIRIAD KEGFNVSKKE LGFFEDDLEK FDQIISSPYG IILVTGPTGS GKSTTLYTAL RELCKPNVNI LTVEDPVEST IKGINQVQVN VKAGLTFATA LRAFLRQDPD IIMVGEIRDS ETAEIAIRAA ITGHLVFSTL HTNDAASSVT RMIDMGIEPF LLSSALVGLI AQRLVRRLCP HCKEAFQPDK NEREILGLKD DEEVTIYRAK GCDECNNTGY KGRIAVYEIL TVNREIKELI SKNVSSDVIK DAAIKMGMKT LRMNCTRLVK EGITTIDEML RIAYSID
|
| |