Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1485 |
Symbol | |
ID | 4601280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1434834 |
End bp | 1436246 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639774260 |
Product | extracellular solute-binding protein |
Protein accession | YP_920885 |
Protein GI | 119720390 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0472403 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA AACTACTCGG ATTTGGAATC CTAACAATAC TGGTCCTAGC AGTCGTCAGC GTTGCAAGCG CAAGCGTGCA AACCCCGCCT CTAAATGTAG AAACTCAGCA GTACGAAAAC ACTCTAGTCC TGATAACGCC TAGCAATACG CTGCTTTTAA AGGCTGTCAC AACAGCGTTT AAGAACTATG CTAAGAACAA GCTCGGAGTG GACGTAGACG TAAAGCTCAT ACAGGCGGGA TCCCCTGAAT GCATGAACAG GATTATAGCC TGGAACGGGA AACCCGAGGC GGACATCTTC TTCGGAGGAG ACCTCATATA CCACTGGAAG CTGAAAGAGA AGGGTCTCCT AGAAAGCTAT AAGCCCAAGT CTCCGGGCTA CGACGCTATT CCGTCCACGT TCCTCGGCTT CCCACTCAAA GACCCCGATG AAATGTGGCA CCCCAAGCTC TGGTGGGGGC ACGGCTTCAT GTACAACACT AAAGTTCTCG AGAAGCTCGG ACTTCAACCC CCGAAGACCT GGGACGACCT GCTAGACCCC AAGTGGAAGG ACCTGATAGT CATGTGTACA CCGTCTAGGT CGTCGTCCAC GTACATCAAC GTCGGAATAA TAATCCAGAA CAGAGGGTGG GATCAGGGAT GGGCCTTTTG GAGGAGGCTC GCCGCCAACG TAGGCGCCTT CGTGCAGAGA AGCGCGGACG TCGTAGACTT GGTCTCTAAA GGTGAGTATG CCGTAGGCTT TGCATACAGC CAGGGGGCCA TTATCAACAG GTACAACGGA TACCCCGTTT CGATGTACAT GGATCCCACG GGTTTCATAG TTTCAGGTGT ATCCCTACTC AAGGGAGCGC CTCACCCGAA CATCGCTAAA GCCTTCCTCG ACTGGTGGTA CACAGAGGAC GCCCAGCAAG CCGCTCTAAG CGCGGGAGGA ATACCCGTGC TTCCCACAGT CAAGATAGCC GGACCTCCAA ACTCCACGGC TGCTATCCTC AGGGAGTACC TCGGAGGAAA GGACACGATA TACGACTACC TCAAGACCTT TACCAACGTG AAGTTCTACA ACTTCACCTT CGCTGAAGCT ATGTACACGA ATATCAGTAA GATTTTCGAC AACACGATAG TCGCTAAGCA CTCGGAGCTG AAGGACGCTT GGAGCACTAT TCTAAGCGTT CAGGCAAAGG TAAAAGGAGT GCCGGACGCG GAAGCCAAGC TGGCCCAGGC TATAAGTGCG TTCGACAAGG GAGACTACGC CACTGCCAAG GCGCTAGCTC TGGAAGCCGC GAAAATAGCG GAGACAGCGC CTAAAGGGCC TTCCGCCACG GAGATCCTGG TCTACGCGGT CATCGCCCTA GTCGTGATAG CTGCACTTGC ATACCTGTTA CGCGGAAAGC TAGGCAAGAA GGCTAAGCAG TAA
|
Protein sequence | MNKKLLGFGI LTILVLAVVS VASASVQTPP LNVETQQYEN TLVLITPSNT LLLKAVTTAF KNYAKNKLGV DVDVKLIQAG SPECMNRIIA WNGKPEADIF FGGDLIYHWK LKEKGLLESY KPKSPGYDAI PSTFLGFPLK DPDEMWHPKL WWGHGFMYNT KVLEKLGLQP PKTWDDLLDP KWKDLIVMCT PSRSSSTYIN VGIIIQNRGW DQGWAFWRRL AANVGAFVQR SADVVDLVSK GEYAVGFAYS QGAIINRYNG YPVSMYMDPT GFIVSGVSLL KGAPHPNIAK AFLDWWYTED AQQAALSAGG IPVLPTVKIA GPPNSTAAIL REYLGGKDTI YDYLKTFTNV KFYNFTFAEA MYTNISKIFD NTIVAKHSEL KDAWSTILSV QAKVKGVPDA EAKLAQAISA FDKGDYATAK ALALEAAKIA ETAPKGPSAT EILVYAVIAL VVIAALAYLL RGKLGKKAKQ
|
| |