Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1339 |
Symbol | |
ID | 4809479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1629675 |
End bp | 1630913 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106763 |
Product | type II secretion system protein E |
Protein accession | YP_001037764 |
Protein GI | 125973854 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4962] Flp pilus assembly protein, ATPase CpaF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCTGG AAGAAAAAGA GAAGCTTATT GCTCAGATAA GAAAGCATAT AAGCGAAAAC CTGGATTTGA GAAAGGACTT TTCGGATGAA GAAATAAAGG ACATTATTAC AAATGTTGTT TTTGAAAGGT CAAGGGATTA TTACCTCAGC GTGGGGGAGA AGAAGGAAAT TGCCGATGCG ATTTTTAATT CCATGAGAAG GCTGGATGTT CTTCAACCCC TCATTGATGA CAAAAGTATT ACTGAAATAA TGATAAACGG CCCGGACTCA ATATTTATTG AAAGGGACGG AAGAGTCTCA AAATTGAACG TAAAATTTGA AAGTCGACGC AAGCTGGAGG ACGTAATTCA GACTATTGTA TCAAGGGTGA ACAGGACGGT AAATGAGGCG TCTCCGATTG TTGATGCCAG GTTGCCGGAC GGTTCCCGTG TAAACGTGGT TTTACCGCCG ATAGCTTTAA ACGGGCCTGT GGTTACCATA AGAAAGTTTC CGGAAAAACC GATGACGATA GAGCAGCTTA TAAAATACGG TTCAATCACC GAGGAAGTTG CTGAAGTGCT GGAGAGGCTG GTTAAAGCAA AATATAATAT ATTTATCTGC GGAGGTACGG GCTCGGGAAA AACCACATTT TTGAATGCTC TTAGCAATTT TATTCCGAAG GACGAGAGAA TTGTTACAAT AGAAGACTCG GCAGAGCTTC AGATTACCGG GGTGGAGAAC ATTGTCAGGC TGGAAACAAG GAATGCCAAT ACGGAGGGAA AGGGAGAGAT TACAATCAGG GATCTCATAA GAACTTCACT TCGTATGAGG CCGGAGAGAA TTATTGTGGG TGAGGTGCGT GGAAAAGAGG CACTCGACAT GCTTCAGGCG ATGAATACCG GACATGACGG TTCCCTTTCC ACCGGACACG CAAACTCCAC AAAGGACATG CTTTCAAGGC TTGAAACCAT GGTGCTGAGC GGTGCCGAAA TGCCTTTGGA GGCTATCAGA CAACAAATAG CTTCTGCGAT AGATATAATT ATTCATTTGG GAAGGCTCAG GGATAAATCG AGAAGAACCC TTGAGATTAC AGAGGTTGTG GAATACAAGA ATGGGCAGAT TGTTCTAAAT CCGCTTTATG AGTTTGTCGA AGAGGGGGAA ACTCCGGAAA AACAGGTGAT TGGCACCTTG AGAAGAACAA AGAATGAAAT GGTGAACAAA CTCAAATTCA AGATGGCGGG TATATCCGAC AAGTTTTAA
|
Protein sequence | MELEEKEKLI AQIRKHISEN LDLRKDFSDE EIKDIITNVV FERSRDYYLS VGEKKEIADA IFNSMRRLDV LQPLIDDKSI TEIMINGPDS IFIERDGRVS KLNVKFESRR KLEDVIQTIV SRVNRTVNEA SPIVDARLPD GSRVNVVLPP IALNGPVVTI RKFPEKPMTI EQLIKYGSIT EEVAEVLERL VKAKYNIFIC GGTGSGKTTF LNALSNFIPK DERIVTIEDS AELQITGVEN IVRLETRNAN TEGKGEITIR DLIRTSLRMR PERIIVGEVR GKEALDMLQA MNTGHDGSLS TGHANSTKDM LSRLETMVLS GAEMPLEAIR QQIASAIDII IHLGRLRDKS RRTLEITEVV EYKNGQIVLN PLYEFVEEGE TPEKQVIGTL RRTKNEMVNK LKFKMAGISD KF
|
| |