Gene Tpen_0808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0808 
Symbol 
ID4602139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp761956 
End bp763488 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content57% 
IMG OID639773584 
Producttype II secretion system protein E 
Protein accessionYP_920212 
Protein GI119719717 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTCAGC TGGTAGGTCT TCCTCGGCTT GAAGAGGGGG AGGTGCTCGT CGAGTACTAC 
CCCGTGACTC CCCCCTTCGC GTACGCTATG ATCGTGAAGA AGCGTGGGGG TAGCCTTGAG
TACAGGCTCG TGGAGCCCCC ACTTACCAGC GACGACGTGG AGAAGCTTGA AAGAATAAAG
AGGCTTCTAC TGGAGTACGC TCCTAGGAAG GTAGACTCCG CTCTAGGCGC TCTGGGTAGC
GCCCCGGAGA AGTACCTCGA AGACGAAGTG AACTACGTGA TTAAGAAGCA CAAGATACCC
GTGCCTCCCG AGGCCCTAGA CAAGTACATG TACTACATCA AGAGGGACAC GCTCGGCTAC
GGGAGGATAG ACGCTCTTCT AAGGGACGTG GAGCTGGAGG ATATATCCTG CGACGGGCTA
GGTACACCCG TCTACGTTTG GCACAGGAGG TACGAGTCGC TCCCCACGAA CATAGTGTTC
AAGGATCCCG GCGAGCTCGC CAGCCTTATA CTCAAGCTCA GCTTCCGCGC CGGGAGACAG
ATATCCGTCG CGCAACCGAT AGTGGAGGGC TCTCTCCCGA TGGGCTTTAG GCTCCACGCG
ACGCTGGAAG AGGTATCCCG CAGGGGAGGG ACGTTCACTA TAAGGAAGTT CAGGGAAGTT
CCATTCACCA TAGTCGACCT CGTAGCCGCG GGGACAGTTT CGGAGGAGCT CGCAGCCTAC
CTCTGGTACC TCGTGGAGAA CTATAGGAGC GTGCTAGTAG TTGGCGCGAC CGCGGGCGGG
AAGACGACGA CCCTGAACGC TATAGCCACA TTCATACGCC CGGAGGCGAA GATAGTCACG
ATAGAGGATA CGCCCGAGCT GAGGCTACCC CACGAGAACT GGGTACCGCT GGTAACGCGT
CCCAGCCACG AGGAGTGGGT TCGCAACGTC GACCTCTTCG ACCTCTTGAA GAGCGCTATG
AGGATGAGGC CCGACTACCT GATAATAGGC GAGATAAGAG GGGAGGAGGC CTTCACGCTC
TTCCAAGCGA TAGCCACAGG GCACTCGGGT ATGAGCACGC TCCACGCCGA GAGCATAGAC
TACGCTGTGA AGCGCCTCGT ATCGGAGCCC ATGAAGGTCC CGCTCTTCCT CCTACCCATG
ATGAACGTCT ACATACTCAT CAAGAGGTTA AAAATCGGGG ACCGAATAGT CAGGAGGGTT
GTGAGCGTGC AGGAAGCCCT AGGCATAGAC GAGTCCGCGA AAACAGTGGT TTTCAGGGAA
GTCTTCAGGT ACAACCCTGT CACAGCCAGG ATAGAGAGGT CCGGCGAGAG CGAGATGCTG
AGGAGGATAG CCGAGGAGAG GTACATACCT CTGGGCGACG TATACCAGGA GATCGCCAGG
AGGAAGGCGA TAATAGAGAC ACTCGTCAGG AACAATATAC GCCGCTACGA GGAGGTAAGC
AGGGTGGTTC GCGACTACTA CAACAACCCG GAGGCCACGT ATCACCAGCT GATGAGCGGG
ACGTACGTCT TCACGCCTTT AAGGGCTCGA TAG
 
Protein sequence
MSQLVGLPRL EEGEVLVEYY PVTPPFAYAM IVKKRGGSLE YRLVEPPLTS DDVEKLERIK 
RLLLEYAPRK VDSALGALGS APEKYLEDEV NYVIKKHKIP VPPEALDKYM YYIKRDTLGY
GRIDALLRDV ELEDISCDGL GTPVYVWHRR YESLPTNIVF KDPGELASLI LKLSFRAGRQ
ISVAQPIVEG SLPMGFRLHA TLEEVSRRGG TFTIRKFREV PFTIVDLVAA GTVSEELAAY
LWYLVENYRS VLVVGATAGG KTTTLNAIAT FIRPEAKIVT IEDTPELRLP HENWVPLVTR
PSHEEWVRNV DLFDLLKSAM RMRPDYLIIG EIRGEEAFTL FQAIATGHSG MSTLHAESID
YAVKRLVSEP MKVPLFLLPM MNVYILIKRL KIGDRIVRRV VSVQEALGID ESAKTVVFRE
VFRYNPVTAR IERSGESEML RRIAEERYIP LGDVYQEIAR RKAIIETLVR NNIRRYEEVS
RVVRDYYNNP EATYHQLMSG TYVFTPLRAR