Gene Tpen_0445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0445 
Symbol 
ID4601868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp403135 
End bp404949 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content54% 
IMG OID639773212 
Producttype II secretion system protein 
Protein accessionYP_919857 
Protein GI119719362 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1955] Archaeal flagella assembly protein J 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.179017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCGAG CATCGCTTGA CGGGATAGCG TACTCTTTGT TTGGCTGGGC TGGTGAAGCT 
ATAGCAAGGA TGGTTCCGAC CCTTAGAAGC GATATTGTCT CAGCGGACAT GAAGGTGTAT
CCTCCGGCGT ACGCGTCCAG AGTTGTTTTT CTGTCCATCA TAGCCCTGGT TCTAGGCTTC
GTGTTCGACG TGCCGATAGT GATTCTACTC TCGAGAATGG GGGTCGGCTT GGGAGGAGTC
CTGGCTACGA GCTTCTCGGT ACCCCTCATG CTCGCCGGCG CGGTATTCGG CCTCGGCCTC
GTATACCCCA AGATCAAGCT GTCGAACAGG ACAAGCAGGT TTGACCTGGA GGTCCCCTAT
CTTTCTGTCT ACATAACCGT CATGGCGACT GGAGGTATCT CGCCGTATAC GAGCTTCGAG
CGCCTAGCTA AGGCCCCGAA GGTGCTTTTC CAGGAGATAA AGAAGGAGGC GATGTACTTC
TTCCTGAAGG TTAAGGGAAT GGGCCAAGAT CCTCTCTCGG CGATAGAGGA CAGTGCTAAG
CGCGTACCGC ACAACGGCTA CAAGCAACTA ATGCTGGGTT ACGCGGCAAC TTTGAGGGCT
GGAGGCGACG TTGTCCACTA CCTTCAAAGG CAGACAGAGG TCATGCTACG CGAAAGAGTA
TCGCAGATGA AGACTATAGG TGAGAGAATA GGAGCCCTCA TGGAGTCCTA CATGGCTATA
GTGTTGCTCA CCTCTATGAC GCTCTACGTC CTCTACGTTG TGAACATGGC GCTAGCCCAG
GCCGGGATGG GTCTCGCCGG CGGAGAAATT CAGTTCGTAA TGGTGTCGTA CATCATTATG
CCGATGCTTT CGGGGCTCTT CATATACCTA GCCGATCTCA TGCAACCCAA GTACCCCGTG
TACGACTCAA CGCCGTATGT CGTGTACTTG GCCTTGGGAA TACCCATAAC AATCTTCCTC
TTCGTCGCCA TGGTCCTCCC ATTCGCGGTA GCGCCTCCAG CCTCTACTGT GCTAAGGAGC
GTTTTCTCAC CCTTCGTCGG CGCCGTGTTG CTCCTGACGA GGCTCTTAGG GCTTGGGAAG
GGATACGAGA GCGGAGTTGG CATGATAATC TCGCTGACCC TGGGGCTCCT CCCGGCGATG
ATCGTAGAAA CGCGGTCAAC ACTGAAGTTC AGCGGTATAC AGTACGGGTT AACGAGGTTT
CTAAGGGACC TCGTCGAGGT TAGGAAAACC GGTATGGCGC CGGAGAAGTG CATAATTAAC
CTGAAGGACA GGGACTACGG GAAGTTTACC CCCTACCTCC GCGACATAGC TAAGCAGGTC
GGGTGGGGTG TGTCCCTTCA CACGATCTTT GAGAGATTTT CGAAGGGCAT GAAGAACTGG
TTCGCCCTTA TCTCGATGTT TCTACTCGTT GAGAGCATCG AAGTAGGAGG CGGCACGCCG
CAGACGCTAG AGGCGCTTGC AAGCTACGCT GAGACGCTAG AGCAAGTCGA GAAGGAGAAA
AGGGCGGCTT TGAGGCCGCT GATGCTAATG CCCTATGTGG GAGCGCTCAT AATAACGGTT
GTAGTCCTGA TCCTAGTAGA GTTCATGGGT TCCATGCTTA AATTCGCGGG AACAGCGATC
TCGACGGAAC AGCTTGTAGG AATGTTCCTG CCCCCGGTGA TCATTAATAG CTACATGATG
GGGCTAGCTG CCGGAAAGAT AGGTTCCGAG AGGGTCGCGG CGGGTTTCAA GCACGCGATC
CTGCTACTAC TCGCAAACCT CGTAGCCATG ATGATCGCTC CACAAATCAC GGCGGGGATG
ATGCCTAAAC TCTAG
 
Protein sequence
MARASLDGIA YSLFGWAGEA IARMVPTLRS DIVSADMKVY PPAYASRVVF LSIIALVLGF 
VFDVPIVILL SRMGVGLGGV LATSFSVPLM LAGAVFGLGL VYPKIKLSNR TSRFDLEVPY
LSVYITVMAT GGISPYTSFE RLAKAPKVLF QEIKKEAMYF FLKVKGMGQD PLSAIEDSAK
RVPHNGYKQL MLGYAATLRA GGDVVHYLQR QTEVMLRERV SQMKTIGERI GALMESYMAI
VLLTSMTLYV LYVVNMALAQ AGMGLAGGEI QFVMVSYIIM PMLSGLFIYL ADLMQPKYPV
YDSTPYVVYL ALGIPITIFL FVAMVLPFAV APPASTVLRS VFSPFVGAVL LLTRLLGLGK
GYESGVGMII SLTLGLLPAM IVETRSTLKF SGIQYGLTRF LRDLVEVRKT GMAPEKCIIN
LKDRDYGKFT PYLRDIAKQV GWGVSLHTIF ERFSKGMKNW FALISMFLLV ESIEVGGGTP
QTLEALASYA ETLEQVEKEK RAALRPLMLM PYVGALIITV VVLILVEFMG SMLKFAGTAI
STEQLVGMFL PPVIINSYMM GLAAGKIGSE RVAAGFKHAI LLLLANLVAM MIAPQITAGM
MPKL