Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1453 |
Symbol | |
ID | 4600579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1403839 |
End bp | 1405086 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639774228 |
Product | extracellular solute-binding protein |
Protein accession | YP_920853 |
Protein GI | 119720358 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0104666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGAAA AACAACAAAA ACAAGCCTCT AAGAAGACCC TGACAATAGT AGCAGCAGTT GTTCTAGTCC TCGTTGTACT GGGAGTAGCC GCCATGCTAC TAACGCAGAA ACCCTCCGCT CCTAAACGGA ACGTAACGAT CGTTATCTGG CACGCCATGG GTCCAGAGGA AGTTAAGACG CTCGAAGACG TCATCGCGGA CTTCCACGTT CAACACCCGG AGATCACCGT GAAGCTTGAG CAGAAAGCGG ACCTCGAGAC CTCCCTCAAA ACGGCCATCC CGGCTGGGCA GGGCCCAGAC CTCTTCATAT GGGCTCACGA CTGGATAGGT AAGTTTGCAG AGGCCGGCCT CCTCGAGCCG ATCGACGAGT ACGTGACTCC CAGCGTGCTG AACAAGTTCA GCCCGATAGG GCAGAACGCT ATAGAGTACC GCGGGCACTA CTACGCGATG CCCCTCGCCG CCGAGACGGT CGCCCTGATC TACAACAAGG CCTTGGTGCC TAACCCGCCG AAGACCTTCG ACGAGATGAA GAGCATAATG GCCAAGTTCA CTAACCCGGA CAAGGGTACG TACGGCCTTG CGACGCCGAT AGACCCCTAC TTCCTCTCCG GGTGGGTGCA CGCCTTCGGA GGCTACTACT TCGACGATAA GACCAAGCAG CCCGGGCTGG ACAAGCCTGA GACGATAAAG GGCTTCAAGT TCTTCTTCGA GCAGGTATAC CCCTACGTCG CTAAGACCCG CGACTACAAC GCGCAGGTAA GCCTCTTCCT CGAGGGCAAA GCCCCCATGA TGATCAACGG TCCTTGGAGC ATCGGCGACG TCAAGAAGGC TGGCATAAAC TTCGGCGTAG CCCCGTTGCC ACCGATAGAC AGCTCGAGCG TGCCGCACCC GTACGGCGGC GTGAAGCTGG TCTACGTAGC TAAGGGAGTT AAGGACAAGG CCGCGGTCTG GACGTTCCTC GAGTGGTTGA CCACGAATCC GAACGTCATC AAGCAGTTCG CCATACGCAA CGGCTATATC CCCGTGCTCA AAGAGGTCCT CAATGACCCG GAGATACAGA ACAACCCCGT GATCTACGGC TTCGGGCAGG CCGTCCAGAA CGCTATCCCG ATGCCTAAGA GCCCCGAAAT GGCGGCCGTC TGGGGACCCG TGGACACTGC CATCACGAAC ATCATGGGCG GAAAGCAGAG CATAGAGGCC GCACTGACAG CCGCGCAGCA GGAGGTTCTG TCCGCCTTGA AGAAGTAA
|
Protein sequence | MTEKQQKQAS KKTLTIVAAV VLVLVVLGVA AMLLTQKPSA PKRNVTIVIW HAMGPEEVKT LEDVIADFHV QHPEITVKLE QKADLETSLK TAIPAGQGPD LFIWAHDWIG KFAEAGLLEP IDEYVTPSVL NKFSPIGQNA IEYRGHYYAM PLAAETVALI YNKALVPNPP KTFDEMKSIM AKFTNPDKGT YGLATPIDPY FLSGWVHAFG GYYFDDKTKQ PGLDKPETIK GFKFFFEQVY PYVAKTRDYN AQVSLFLEGK APMMINGPWS IGDVKKAGIN FGVAPLPPID SSSVPHPYGG VKLVYVAKGV KDKAAVWTFL EWLTTNPNVI KQFAIRNGYI PVLKEVLNDP EIQNNPVIYG FGQAVQNAIP MPKSPEMAAV WGPVDTAITN IMGGKQSIEA ALTAAQQEVL SALKK
|
| |