Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0727 |
Symbol | |
ID | 4600847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 676434 |
End bp | 677612 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639773503 |
Product | type II secretion system protein E |
Protein accession | YP_920132 |
Protein GI | 119719637 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGGCTTC CAAGGCTCTT CAGGGGGAAC GCGAAGCGCG AAGAGGAGCT TGTAAGCGTA AGTGAAGCTC TGCCGCCGGA CGCCAGGCTG GTAGACAAGA ACGAGTTCGG CGCTCTCTTC ATCAAGAACG ACACTTTACT GGTTCTGCCC TTCAGGCGGC ACGAGGTCTG GGGAGCTTTC GCGCCAATCT TTACAAGCAG TAAAGTGGAA GAAGCCTGGA TCTTCGAGGG TAGAGGGGTC GTCACGCTAC GCGGCGTCGG GAGGGTGAGC TTCCCGGCAG ACTCGGTCCC GGACTCCTTA GAAGAGCTTG TAACCAGAAT CATAGCGGCG AGCGGGGTGA ACGTCTCCCT GAGGAATCCG AGAGGCGTAG CTGACATCGG AGAGTGGCGC GTAGCGATAC AAGTCCAGTC AGGGGGTCAG CTCCACCTCG TCGCGACGCG AGTAACCAGC GTCCCTCCCC TCGAGGAGGT CGTCCCGCCC CTTCTCGCAA CTAGGCTCGT GCTACTGCTG GTTCGCCCAT CCACGGTCGT GATAACGGGG CCTCCGGGAA GCGGAAAAAC CACCGTGCTC TCCGCGCTTG TAAAGGCGGT CGCGGACTTT TTCCCCTGGC TGCACATCGC GATAGTCGAG AAGTACAGGG AGCTCAGTTT CAGAGGGGGC TGGTTCACGC AGGTAGTCTC GGAAAACCTA GCCGAGGGCG TTAGGTACGC TATGCGCTAC CTCAGACCAG ACCTCCTCGT TGTCGGGGAG ATACTGGCCG AAGACTTTTG GAGCCTACTA GAGCCGACTC GCGCGGGCAT CCCCACCGTG ACGACCTTCC ATGCGCCCTC GGCACAGAGA GCCCTCAAGT CTCTCTCGGA TGCCCTTAGG GTTCACCTGG GGGGAGCGAC GAACGTGCTC AACTACCTTG ACGTCCTCGT AGTAACGTCT AAGGTTATAA CGTCGAGCGG AGTCTCGAGG GGGGTAAGCG CGGTCTACCT TTCCGACGGG CAACGCCTAG TGCCGGTCTA CCTCGAAGGG GAGCACGCCG AGGAGGATCT CTTCAAAAAG GCTCTGCCCG ACAAGCTCTA CGTGGGCGAC CTAGTCAACG TAGAGAAGAC TTTAAGAGAG AGGTTCAGGC CGGACGAAGC CCAAGCGAGC TTCCTAAAGG AGCTCGCGGC GCTTTCCCTT GAACGTTGA
|
Protein sequence | MRLPRLFRGN AKREEELVSV SEALPPDARL VDKNEFGALF IKNDTLLVLP FRRHEVWGAF APIFTSSKVE EAWIFEGRGV VTLRGVGRVS FPADSVPDSL EELVTRIIAA SGVNVSLRNP RGVADIGEWR VAIQVQSGGQ LHLVATRVTS VPPLEEVVPP LLATRLVLLL VRPSTVVITG PPGSGKTTVL SALVKAVADF FPWLHIAIVE KYRELSFRGG WFTQVVSENL AEGVRYAMRY LRPDLLVVGE ILAEDFWSLL EPTRAGIPTV TTFHAPSAQR ALKSLSDALR VHLGGATNVL NYLDVLVVTS KVITSSGVSR GVSAVYLSDG QRLVPVYLEG EHAEEDLFKK ALPDKLYVGD LVNVEKTLRE RFRPDEAQAS FLKELAALSL ER
|
| |