Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1641 |
Symbol | |
ID | 4600920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1589277 |
End bp | 1590737 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639774414 |
Product | major facilitator transporter |
Protein accession | YP_921039 |
Protein GI | 119720544 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGATACCG CTCACAAAAA GTATCTACTA ACGGCTCTTT CGATAACAAC GCTCGGAGCG TTCATGGCAG GCCTCGACGC GCGCATAGTA GTCGTCGGGC TAGACGTGAT AGCCTCAGCC CTCAAAGCGG ACATCGAGGA AGCGCTCTGG TTCACGCAAG CATACATGCT TGGAAGCACG CTCATGCTTC TACTCGCGGG GAAGCTCGCC GACCTCTACG GGAGGGTGAA GCTCTACGCA TACGGCTTCC TGCTGTTCAC CCTGGGGTCA ATCCTCTCCG GAGCCGCCGC GACCCCCCTA CAACTGGCCG CCTCGCGCTT CCTGCAGGGC CTCGGCGCGG GTGTTCTCAC AACTCTAAGC GCGACTATAA TCACGGACGT CGCCGTCGGC GGACCCCTTG CCTTTGCGTT GAGCATAAAC TCTCTAGCCT TCCGCCTCGG GTCAATCCTG GGCTTAACCG CTAGCGGGCT GATAATAGGC CTTCTAGGTT GGAGGGGCAT CTTCTACGTC AACGTCCCCG TGGGCATAGC CGGGGCAATA CTCTCCAGGA AGAGGCTCAG GGAGACCTAC ACGCCGCGCG AAAAACCCCT GATAGACTGG GTCGGCTTCT GCCTTTTCAC GGTCTCGCTC CTAGGCCTCC TGCTAGCCTT AACGTTCTAC GCCTACGGTC TCTCCTATAG GAGCTTCGCC CGCCTACTCC TACTGGTATC CGCAGCCTGC TTCCTCCTCT TCATCGCCGT AGAGGCGAGG AGCGACCACC CGATACTCGA CTTATCCCTG TTCAGGATCT GGGGCTTCAC GGGGGGAAAC ATAGCCCAGT TCCTGAACGC CGTTGCGTTC GGCGCGGTCA TGCTGCTCTT AACGCTCTAC TACGAGGTCG CCCTGCGGAA AAGCGCTTTC GAGACGGGGA TAAGCCTCCT CCCCTTCGAG CTCTCGTTCC TCGCGTTCGG GTTGCTGAGC GGAAGGCTCT CCGACAGGTA CGGCTACGTC AAGTTCGCCA TACTGGGGCT ACTCGTGGGT AGCCTCGCAC AGCTTCTACT CGGAGGCTTA ACGGTGAGCA CAAGCCCGGC GCTCGTAGCG GCATACTCGG CACTGCTCGG GGCTGGGAAC GGGCTCTTCC TGTCGCCGAA CACGAGCGCG ATAATGAGCT CTGTGCCCCC GGAGAGGAGG GGGGTCGCCT CGGCTATTAG AGCCATAGTC TTCAACGTCG GGATGACTAT AAGCCTGAAC ATAGCGGTAA TACTTATCTC CACGAGGATC CCCTACGAGA CGGTAACCCA GCTACTCGTA GGAGCAGAGC TAATGACCTC GGACACCGCT ACCAGCAGGG CGCTCCTTGT CGACGCGATA GCCTACACGT TCAGGGTCCT CGCGCTGGTA AACCTCTCGG CGGCCCTCTT CTCCTTCACG AGGCTGAAAG GAGGAAAAAC GGGCCGAGCC CTCCCGGTCC TCGCCGAGTA A
|
Protein sequence | MDTAHKKYLL TALSITTLGA FMAGLDARIV VVGLDVIASA LKADIEEALW FTQAYMLGST LMLLLAGKLA DLYGRVKLYA YGFLLFTLGS ILSGAAATPL QLAASRFLQG LGAGVLTTLS ATIITDVAVG GPLAFALSIN SLAFRLGSIL GLTASGLIIG LLGWRGIFYV NVPVGIAGAI LSRKRLRETY TPREKPLIDW VGFCLFTVSL LGLLLALTFY AYGLSYRSFA RLLLLVSAAC FLLFIAVEAR SDHPILDLSL FRIWGFTGGN IAQFLNAVAF GAVMLLLTLY YEVALRKSAF ETGISLLPFE LSFLAFGLLS GRLSDRYGYV KFAILGLLVG SLAQLLLGGL TVSTSPALVA AYSALLGAGN GLFLSPNTSA IMSSVPPERR GVASAIRAIV FNVGMTISLN IAVILISTRI PYETVTQLLV GAELMTSDTA TSRALLVDAI AYTFRVLALV NLSAALFSFT RLKGGKTGRA LPVLAE
|
| |