Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0445 |
Symbol | |
ID | 4601868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 403135 |
End bp | 404949 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639773212 |
Product | type II secretion system protein |
Protein accession | YP_919857 |
Protein GI | 119719362 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1955] Archaeal flagella assembly protein J |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.179017 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCGAG CATCGCTTGA CGGGATAGCG TACTCTTTGT TTGGCTGGGC TGGTGAAGCT ATAGCAAGGA TGGTTCCGAC CCTTAGAAGC GATATTGTCT CAGCGGACAT GAAGGTGTAT CCTCCGGCGT ACGCGTCCAG AGTTGTTTTT CTGTCCATCA TAGCCCTGGT TCTAGGCTTC GTGTTCGACG TGCCGATAGT GATTCTACTC TCGAGAATGG GGGTCGGCTT GGGAGGAGTC CTGGCTACGA GCTTCTCGGT ACCCCTCATG CTCGCCGGCG CGGTATTCGG CCTCGGCCTC GTATACCCCA AGATCAAGCT GTCGAACAGG ACAAGCAGGT TTGACCTGGA GGTCCCCTAT CTTTCTGTCT ACATAACCGT CATGGCGACT GGAGGTATCT CGCCGTATAC GAGCTTCGAG CGCCTAGCTA AGGCCCCGAA GGTGCTTTTC CAGGAGATAA AGAAGGAGGC GATGTACTTC TTCCTGAAGG TTAAGGGAAT GGGCCAAGAT CCTCTCTCGG CGATAGAGGA CAGTGCTAAG CGCGTACCGC ACAACGGCTA CAAGCAACTA ATGCTGGGTT ACGCGGCAAC TTTGAGGGCT GGAGGCGACG TTGTCCACTA CCTTCAAAGG CAGACAGAGG TCATGCTACG CGAAAGAGTA TCGCAGATGA AGACTATAGG TGAGAGAATA GGAGCCCTCA TGGAGTCCTA CATGGCTATA GTGTTGCTCA CCTCTATGAC GCTCTACGTC CTCTACGTTG TGAACATGGC GCTAGCCCAG GCCGGGATGG GTCTCGCCGG CGGAGAAATT CAGTTCGTAA TGGTGTCGTA CATCATTATG CCGATGCTTT CGGGGCTCTT CATATACCTA GCCGATCTCA TGCAACCCAA GTACCCCGTG TACGACTCAA CGCCGTATGT CGTGTACTTG GCCTTGGGAA TACCCATAAC AATCTTCCTC TTCGTCGCCA TGGTCCTCCC ATTCGCGGTA GCGCCTCCAG CCTCTACTGT GCTAAGGAGC GTTTTCTCAC CCTTCGTCGG CGCCGTGTTG CTCCTGACGA GGCTCTTAGG GCTTGGGAAG GGATACGAGA GCGGAGTTGG CATGATAATC TCGCTGACCC TGGGGCTCCT CCCGGCGATG ATCGTAGAAA CGCGGTCAAC ACTGAAGTTC AGCGGTATAC AGTACGGGTT AACGAGGTTT CTAAGGGACC TCGTCGAGGT TAGGAAAACC GGTATGGCGC CGGAGAAGTG CATAATTAAC CTGAAGGACA GGGACTACGG GAAGTTTACC CCCTACCTCC GCGACATAGC TAAGCAGGTC GGGTGGGGTG TGTCCCTTCA CACGATCTTT GAGAGATTTT CGAAGGGCAT GAAGAACTGG TTCGCCCTTA TCTCGATGTT TCTACTCGTT GAGAGCATCG AAGTAGGAGG CGGCACGCCG CAGACGCTAG AGGCGCTTGC AAGCTACGCT GAGACGCTAG AGCAAGTCGA GAAGGAGAAA AGGGCGGCTT TGAGGCCGCT GATGCTAATG CCCTATGTGG GAGCGCTCAT AATAACGGTT GTAGTCCTGA TCCTAGTAGA GTTCATGGGT TCCATGCTTA AATTCGCGGG AACAGCGATC TCGACGGAAC AGCTTGTAGG AATGTTCCTG CCCCCGGTGA TCATTAATAG CTACATGATG GGGCTAGCTG CCGGAAAGAT AGGTTCCGAG AGGGTCGCGG CGGGTTTCAA GCACGCGATC CTGCTACTAC TCGCAAACCT CGTAGCCATG ATGATCGCTC CACAAATCAC GGCGGGGATG ATGCCTAAAC TCTAG
|
Protein sequence | MARASLDGIA YSLFGWAGEA IARMVPTLRS DIVSADMKVY PPAYASRVVF LSIIALVLGF VFDVPIVILL SRMGVGLGGV LATSFSVPLM LAGAVFGLGL VYPKIKLSNR TSRFDLEVPY LSVYITVMAT GGISPYTSFE RLAKAPKVLF QEIKKEAMYF FLKVKGMGQD PLSAIEDSAK RVPHNGYKQL MLGYAATLRA GGDVVHYLQR QTEVMLRERV SQMKTIGERI GALMESYMAI VLLTSMTLYV LYVVNMALAQ AGMGLAGGEI QFVMVSYIIM PMLSGLFIYL ADLMQPKYPV YDSTPYVVYL ALGIPITIFL FVAMVLPFAV APPASTVLRS VFSPFVGAVL LLTRLLGLGK GYESGVGMII SLTLGLLPAM IVETRSTLKF SGIQYGLTRF LRDLVEVRKT GMAPEKCIIN LKDRDYGKFT PYLRDIAKQV GWGVSLHTIF ERFSKGMKNW FALISMFLLV ESIEVGGGTP QTLEALASYA ETLEQVEKEK RAALRPLMLM PYVGALIITV VVLILVEFMG SMLKFAGTAI STEQLVGMFL PPVIINSYMM GLAAGKIGSE RVAAGFKHAI LLLLANLVAM MIAPQITAGM MPKL
|
| |