Gene Tpen_0439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0439 
Symbol 
ID4601862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp397210 
End bp399225 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content54% 
IMG OID639773206 
Producttype II secretion system protein E 
Protein accessionYP_919851 
Protein GI119719356 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTAGCG AGGTTAAGGT TAAGCAGGAG AAGAAAAAAC CCGCGAAAAA GAAGGAGAAA 
AAAGCCATAG CGGAGACCTC CGAGAAAAAG GAGGAGCCCG TTGTTCTGCT CCCGGTCCAG
GAGTCCTACA GGGTTATCGA CGAGTACTGG GTTGTAGAGC CTTTCGCGAA GGTGAAGATA
GTCGAGATTC CCGAAGCTGG CAACCAGCGT GCCTACTTTG TGGAAGAGGT GCAGCTAAGC
GAGGCTGAGA GGAAGGCTGT CAATAAACTT ATAGACATTC TAAGCGTTGA GCTCGAACCG
CCCGCCTCTT TTGACGTCGA GTTACGGGAG CACGTTGTGG CCGAGGCAAG AAGGCTCGCG
GAGAAGTATA GAAGCGCGGT GAGAGGGCTA TCCGAGGAGA GCTGGAAAAA GGTCATCTAC
TACATCGAGA GGGACCTGGT CGGGTATGGT CCCATAGAGG TCTTAATGCG TGACTACAGG
CTGGAGGATA TAAGCTGTGA CGGTGTAGAT AGGGCTATAC ACGTGTGGCA CAGGGACTAC
GAAAGCATAC CGACAAACAT CGTGTTTAGA AGCAGAGATT ACCTGAGAGA GTTTATAGTT
AAGCTGGCCC ACATGGCCGG GAAGCATATC TCGGCGGCGT TCCCGATAGT CGACGCGATG
CTCCCGGGTA GGCACAGGCT GGCCGCGACG TACGGAGAGG AGGTATCGCC GCGAGGCAGT
ACTTTTACCA TTAGGAAGTT CAGAGAGAAA CCTCTCTCGA TAGTCGAGAT AATTAGTTCG
GGCAACCTCG ACTCGTGGAG CGCGGCCTAC CTGTGGCTAA TGATCGAGAA CAGGATGACA
GCCATGGTGA TAGGAGCAAC GGCCGCCGGT AAGACCACGC TCCTCAACGC GATAGCGAAC
TTCTTCAAGC CCGGCTTCAA GATAGTAACG ATAGAGGAAA CCCCAGAGCT CAACCTCCCC
CACGAGAACT GGGTGCAGCT CGTGAGTAGG GAGAGCTACG GTCTAGGCGA ATCGAAAGTG
GGGGAGATTA CGCTCTACGA CCTTGTAAAG GTCTCCCTCA GGTACAGGCC CGACTACATA
ATAGTCGGAG AAGTCAGAGG CGAAGAGGCC TTCGTACTGT TTCAAGCGAT GTCCACAGGG
CACGGAGGCA TGTCGACCAT GCACGCCGAC TCGCTGGATA GGGCCGTAAA GAGGCTGACA
AGCCCGCCCA TGAACGTCTC GCCGTCCTAC ATACCGTCGC TCAACATAGC GCTTCTCTCC
GAGAGGACGA TTCTCCCCGA CGGGAAGTTC GCTCGTAGAG TCAAGCACAT CTGGGAGATC
GAAGACTACG AGAAGTACCG GGAAATAGTG AAGTGGGATC CAGTTAAAGA CGTTCACGTT
GTGGTATCCG AAAGTCACCA TATACGCTTC ATCGCAGAGA AGCTCGGCAA AAGGGTCGAG
GATTTATACT CCGAGATTGA ACGTAGAAGG CTCGTACTCG AATGGATGAG AATGAAGAAT
ATAAAGGAAA CGAGGGATGT GTTCACCGTT ATCAACAAGT ACTACGTATA CCCCGAAGAG
GTCTACGCGG AGGCACGGAG AGAGCTCGAA GAGAAGGGTG CAATACCAGC ACCCGTAAAG
ATAAGGCCCT CGGTGGTCGT AGAGGTAGCA CCGAAGCCTG TACGCGCGCT TCCAGCGGTA
CAGACTAGAG TAAGTACGCA CGCCACCTCC GAGGATGCGA AGCCTGTTCC TCAGCAACCT
GTGGCTCTCC AACCGCCAGT ACCGCTGTCT GTGGGGCTGA GCCAGGAGAG CTTCCTGGTG
TTGAGGACTT TGGCGTCTCT GGGAGGGGAG GCGGACCACG AGTCCTTAGT CTCGATGGTG
AACCTTCCTA GGGAGGTTTT CAGCAGGGCT ATAAGCGAGC TAGCCGGGAA GCGTCTCCTA
GTTCCCACGC TACTGGTAGA AGGCGGGAAG CCCCTCATAG GCTACAAGAT TACGAGCGAC
GGAGAAAAAA CCCTGAAAAC GATCTCTCAG CAGTAA
 
Protein sequence
MSSEVKVKQE KKKPAKKKEK KAIAETSEKK EEPVVLLPVQ ESYRVIDEYW VVEPFAKVKI 
VEIPEAGNQR AYFVEEVQLS EAERKAVNKL IDILSVELEP PASFDVELRE HVVAEARRLA
EKYRSAVRGL SEESWKKVIY YIERDLVGYG PIEVLMRDYR LEDISCDGVD RAIHVWHRDY
ESIPTNIVFR SRDYLREFIV KLAHMAGKHI SAAFPIVDAM LPGRHRLAAT YGEEVSPRGS
TFTIRKFREK PLSIVEIISS GNLDSWSAAY LWLMIENRMT AMVIGATAAG KTTLLNAIAN
FFKPGFKIVT IEETPELNLP HENWVQLVSR ESYGLGESKV GEITLYDLVK VSLRYRPDYI
IVGEVRGEEA FVLFQAMSTG HGGMSTMHAD SLDRAVKRLT SPPMNVSPSY IPSLNIALLS
ERTILPDGKF ARRVKHIWEI EDYEKYREIV KWDPVKDVHV VVSESHHIRF IAEKLGKRVE
DLYSEIERRR LVLEWMRMKN IKETRDVFTV INKYYVYPEE VYAEARRELE EKGAIPAPVK
IRPSVVVEVA PKPVRALPAV QTRVSTHATS EDAKPVPQQP VALQPPVPLS VGLSQESFLV
LRTLASLGGE ADHESLVSMV NLPREVFSRA ISELAGKRLL VPTLLVEGGK PLIGYKITSD
GEKTLKTISQ Q