Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0853 |
Symbol | |
ID | 4810471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1027899 |
End bp | 1029584 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106269 |
Product | type II secretion system protein E |
Protein accession | YP_001037280 |
Protein GI | 125973370 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000472298 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAAC AAAAAAGAAA AGGTCTTGGC GACATTTTAG TGGAAGCCGG GCTTATTTCA AAAGAGCAGC TGGATAAAGC TTTAAAACTT CAGAAAAAAA CAGGCCAAAA ACTTGGAGTT TTGCTGGTTT CTGAAGGAAT TGTGACCCAG GAGGACATAA TGAGGGTCCT GGAGGAAAAA ATAGGTGTTT TACGTGTGGC ATTGGAAGAA TGCAACATTG ATCCCGCTGT TTGCAGCTTA ATCCCCGAAA AACTTGCCAG AAGGTATGAA TTGATTCCTA TAGCACAAAA AGACGGAGTT CTTAGGGTTG CCATGAGCGA TCCTTTAAAT GTTTTTGCCA TTGATGATAT TGAGGATTAT ACAGGTATGA GAGTTGAGCC TGTAGTTGAT TTTGCGTCGT CAATAAAAAA TGCCATTGAC AAATATTACA GAACACAGCA TGTTTTGGTG GAGCCTGTAA AGGAAAAAGG AATTTTATTT AAAATTGATG AGGAAACAAT AGAGCTTGAA AGCGTTGAGG CGGAAAATGA ATCTGCCTCA ATGCTTTTAA ATTCCATAAT AGAGCAGGCG ATAAGAAACG GGTCCGGAGA TATACATATT GAACCTTTGC AAAATGCATT AAAAATAAGG TTTAGAACCG ACGGACAAAT GCATGAGGTC ATGAGAACGG AAATTGGCAT GCTAAATGGT GTTTTGGCAA AGATAAAGGC AATTTGCGGT ATGAATATGA ACGAAAAGGC AGTTCCGCAG GACGGCAGGG TGAAGGTAAG TCTGGACGGA AGAGATTACA ATCTTAAGGT GTCGATTCTT CCGACCGTTT TTGGAGAGAA AATTGCAATC CGTATTGTTC ATAAAAAGAC TTCCGTCATT CCAAAAGAGC AGCTGGGAAT TTGTCAGGAG GACCTCGTAA AATTTGAGAG AATGATAAAA AGTCCTAAAG GATTGGTTTT GATAACAGGT CCTGAAGGAA GCGGCAAAAC CACAACTTTG TATTCCGCCG TAAGTGAAAT CAACAGTCCG AATGTACATA TAATTACCAT TGAAGACCCT GTTGAATACG TTATTGAAGG AGTAAACCAG GTACAGGTCA ACATGAAGAC AGGCCTGACT TATGAAAAAG GTCTAAGTTC AATTTTAGAA CAGGGACCGG ATGTAATTGT CATTGGGGAC ATAAAGGATG CGAAAACGGC TGAAATAGCT GTAAAGGCGG CAATGGGAGG GCATCTTGTA CTTGGAGCTT TTTGTGCCAA TGATACTTTG GACGCAGTGT TAACTCTTGT GGAAATGGGA ATAGATCCGT TTTTTATTGC ATCGTCCCTG ATAGGGGTAA TTTCTCAAAG GCTTGTGAGA AAAATTTGTC CCAACTGCAT AAAGAAGTAT GTTGCAACAG ATGAGGAACT TTCACTTCTT GAACTGGACA GACCCGTCGA ACTGTATTCG GGAAATGGGT GCGCAGAATG TTCCGGTACC GGATACAAAG GGAAATTGGG TGTTTTTGAG GTGCTGAATG TGGACAAGAG CTTCAGGGAT ATGATGAAGG AAAACTTTGC AAAGGAGAAA TTGAGAAAAT TTTGTGTTTT AAGGGGAATG AAAACTTTAA AAGAAAATGC AAAACAGCTT GTTCTTGAGG GAAAAACCAC TGCTTTTGAG ATGTCAAGAA TGCTGTCTTT TGAAGAAGAA TTATAA
|
Protein sequence | MQKQKRKGLG DILVEAGLIS KEQLDKALKL QKKTGQKLGV LLVSEGIVTQ EDIMRVLEEK IGVLRVALEE CNIDPAVCSL IPEKLARRYE LIPIAQKDGV LRVAMSDPLN VFAIDDIEDY TGMRVEPVVD FASSIKNAID KYYRTQHVLV EPVKEKGILF KIDEETIELE SVEAENESAS MLLNSIIEQA IRNGSGDIHI EPLQNALKIR FRTDGQMHEV MRTEIGMLNG VLAKIKAICG MNMNEKAVPQ DGRVKVSLDG RDYNLKVSIL PTVFGEKIAI RIVHKKTSVI PKEQLGICQE DLVKFERMIK SPKGLVLITG PEGSGKTTTL YSAVSEINSP NVHIITIEDP VEYVIEGVNQ VQVNMKTGLT YEKGLSSILE QGPDVIVIGD IKDAKTAEIA VKAAMGGHLV LGAFCANDTL DAVLTLVEMG IDPFFIASSL IGVISQRLVR KICPNCIKKY VATDEELSLL ELDRPVELYS GNGCAECSGT GYKGKLGVFE VLNVDKSFRD MMKENFAKEK LRKFCVLRGM KTLKENAKQL VLEGKTTAFE MSRMLSFEEE L
|
| |