Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1109 |
Symbol | |
ID | 4601103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1046553 |
End bp | 1047833 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639773886 |
Product | major facilitator transporter |
Protein accession | YP_920511 |
Protein GI | 119720016 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAGCA AATATAAATG GTTCGTGGTT CTGTTCTTCT TCACCTTCCT GACTATACAC CAGGCGGACC GCTTCATAGT CTCGGCTGTT GCGCCGCAAG TAATGGACGA GTTCAAGGTG TCGTACAGCC AGCTAGGTCT AGTATTCTCT CTTACGGTGC TGGTAGCCGC TTTCCTCTAC CCGGTGTGGG GCTACCTCTA CGACAGGTAC TCGCGGAAGC TCTTAGCCGG TCTAGCGGCG CTGATATGGG GTTTCACCAC CATCTTCAAC GCCCTCTCGA GAACCTTCTC CGAGTTCTTC GCTACGAGGC TCGCCACGGG TATCGACGAC GCGGCGCCTC CCGGAATTTA TAGCCTCGTG GCAGACTACT TTGACCCCTA CAGCCGCGGG AAGGCTCTCG GCTTGCTTAA CGCGACGGGG CCTCTCGGGG CGATCATAGG GACTATACTC TCGTTGAGCA TAGTGGCGGC GGGGCTTAGC TGGAGGAACG CGTTCTTCAT AACTGGTCCC ATAGGGGTTG CGATCGGCGC GTTAACCTTC TTCCTGGTAA AGGATGTGCC GAGGGGTGTT TCCGAGCCAG AGCTCAAGGA CGTGTTAACG GAGGATATCT ACAGGGCGAA GCTATCCGAC CTGCCAAAGG TGCTGGAGAA CAAATCCCTC GTACTCCTCT ACCTGCAGGG CTTCTGGGGC GTTTTCCCGT GGAACGCGAT TACCTTCTGG TTCGTGACGT ACATGGAGAA GGAGAGGGGG CTGTCCCCCG ACACCGTGAT GGTGGTAATG TCCCTGTCGC TCATCGCCAT GGTCGCAGGG AACATAGTCG CAGGGATTAT CGGAGACTGG TTGTTCAAAA AGACGAAGAG GGGTAGGGCG ATTCTCGGGG CGGTAGTAGT GTTCTTCTCG GCAGTGCTCA TCTACCTCGC GATTAGGGCT GAAAGCACGG AGGAGTTCAT TCTGTTCACT GTTCTCACAG CGTTCGAGAT TCCCATGGCG GCCCCGAACG TAGTCGCTGC CATCACGGAT GTCACCGAGC CCGAGCTGAG GTCCAGCGCT ACCGGATACC TAAGGTTCTT CGAGAACCTC GGTAGCGCTA CGTCGCCGTT TCTGACAGGC GTACTGGCGG AGTCCATGGG CCTTGGGGAG GCTATACTGC TGGTAAGCGT GTATACGTGG CTTCTGTGCT TCGTCTTCTT CGCGGTACTA GCAGCGATAA TCCCCCGCGA CATAGACAGG CTGAGAAACC TCATTAGGGA GAGGGCGGAG AGACTGAGGG GTGGGCGCTG A
|
Protein sequence | MRSKYKWFVV LFFFTFLTIH QADRFIVSAV APQVMDEFKV SYSQLGLVFS LTVLVAAFLY PVWGYLYDRY SRKLLAGLAA LIWGFTTIFN ALSRTFSEFF ATRLATGIDD AAPPGIYSLV ADYFDPYSRG KALGLLNATG PLGAIIGTIL SLSIVAAGLS WRNAFFITGP IGVAIGALTF FLVKDVPRGV SEPELKDVLT EDIYRAKLSD LPKVLENKSL VLLYLQGFWG VFPWNAITFW FVTYMEKERG LSPDTVMVVM SLSLIAMVAG NIVAGIIGDW LFKKTKRGRA ILGAVVVFFS AVLIYLAIRA ESTEEFILFT VLTAFEIPMA APNVVAAITD VTEPELRSSA TGYLRFFENL GSATSPFLTG VLAESMGLGE AILLVSVYTW LLCFVFFAVL AAIIPRDIDR LRNLIRERAE RLRGGR
|
| |