Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0808 |
Symbol | |
ID | 4602139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 761956 |
End bp | 763488 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639773584 |
Product | type II secretion system protein E |
Protein accession | YP_920212 |
Protein GI | 119719717 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCTCAGC TGGTAGGTCT TCCTCGGCTT GAAGAGGGGG AGGTGCTCGT CGAGTACTAC CCCGTGACTC CCCCCTTCGC GTACGCTATG ATCGTGAAGA AGCGTGGGGG TAGCCTTGAG TACAGGCTCG TGGAGCCCCC ACTTACCAGC GACGACGTGG AGAAGCTTGA AAGAATAAAG AGGCTTCTAC TGGAGTACGC TCCTAGGAAG GTAGACTCCG CTCTAGGCGC TCTGGGTAGC GCCCCGGAGA AGTACCTCGA AGACGAAGTG AACTACGTGA TTAAGAAGCA CAAGATACCC GTGCCTCCCG AGGCCCTAGA CAAGTACATG TACTACATCA AGAGGGACAC GCTCGGCTAC GGGAGGATAG ACGCTCTTCT AAGGGACGTG GAGCTGGAGG ATATATCCTG CGACGGGCTA GGTACACCCG TCTACGTTTG GCACAGGAGG TACGAGTCGC TCCCCACGAA CATAGTGTTC AAGGATCCCG GCGAGCTCGC CAGCCTTATA CTCAAGCTCA GCTTCCGCGC CGGGAGACAG ATATCCGTCG CGCAACCGAT AGTGGAGGGC TCTCTCCCGA TGGGCTTTAG GCTCCACGCG ACGCTGGAAG AGGTATCCCG CAGGGGAGGG ACGTTCACTA TAAGGAAGTT CAGGGAAGTT CCATTCACCA TAGTCGACCT CGTAGCCGCG GGGACAGTTT CGGAGGAGCT CGCAGCCTAC CTCTGGTACC TCGTGGAGAA CTATAGGAGC GTGCTAGTAG TTGGCGCGAC CGCGGGCGGG AAGACGACGA CCCTGAACGC TATAGCCACA TTCATACGCC CGGAGGCGAA GATAGTCACG ATAGAGGATA CGCCCGAGCT GAGGCTACCC CACGAGAACT GGGTACCGCT GGTAACGCGT CCCAGCCACG AGGAGTGGGT TCGCAACGTC GACCTCTTCG ACCTCTTGAA GAGCGCTATG AGGATGAGGC CCGACTACCT GATAATAGGC GAGATAAGAG GGGAGGAGGC CTTCACGCTC TTCCAAGCGA TAGCCACAGG GCACTCGGGT ATGAGCACGC TCCACGCCGA GAGCATAGAC TACGCTGTGA AGCGCCTCGT ATCGGAGCCC ATGAAGGTCC CGCTCTTCCT CCTACCCATG ATGAACGTCT ACATACTCAT CAAGAGGTTA AAAATCGGGG ACCGAATAGT CAGGAGGGTT GTGAGCGTGC AGGAAGCCCT AGGCATAGAC GAGTCCGCGA AAACAGTGGT TTTCAGGGAA GTCTTCAGGT ACAACCCTGT CACAGCCAGG ATAGAGAGGT CCGGCGAGAG CGAGATGCTG AGGAGGATAG CCGAGGAGAG GTACATACCT CTGGGCGACG TATACCAGGA GATCGCCAGG AGGAAGGCGA TAATAGAGAC ACTCGTCAGG AACAATATAC GCCGCTACGA GGAGGTAAGC AGGGTGGTTC GCGACTACTA CAACAACCCG GAGGCCACGT ATCACCAGCT GATGAGCGGG ACGTACGTCT TCACGCCTTT AAGGGCTCGA TAG
|
Protein sequence | MSQLVGLPRL EEGEVLVEYY PVTPPFAYAM IVKKRGGSLE YRLVEPPLTS DDVEKLERIK RLLLEYAPRK VDSALGALGS APEKYLEDEV NYVIKKHKIP VPPEALDKYM YYIKRDTLGY GRIDALLRDV ELEDISCDGL GTPVYVWHRR YESLPTNIVF KDPGELASLI LKLSFRAGRQ ISVAQPIVEG SLPMGFRLHA TLEEVSRRGG TFTIRKFREV PFTIVDLVAA GTVSEELAAY LWYLVENYRS VLVVGATAGG KTTTLNAIAT FIRPEAKIVT IEDTPELRLP HENWVPLVTR PSHEEWVRNV DLFDLLKSAM RMRPDYLIIG EIRGEEAFTL FQAIATGHSG MSTLHAESID YAVKRLVSEP MKVPLFLLPM MNVYILIKRL KIGDRIVRRV VSVQEALGID ESAKTVVFRE VFRYNPVTAR IERSGESEML RRIAEERYIP LGDVYQEIAR RKAIIETLVR NNIRRYEEVS RVVRDYYNNP EATYHQLMSG TYVFTPLRAR
|
| |