Gene Tpen_1641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1641 
Symbol 
ID4600920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1589277 
End bp1590737 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content60% 
IMG OID639774414 
Productmajor facilitator transporter 
Protein accessionYP_921039 
Protein GI119720544 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATACCG CTCACAAAAA GTATCTACTA ACGGCTCTTT CGATAACAAC GCTCGGAGCG 
TTCATGGCAG GCCTCGACGC GCGCATAGTA GTCGTCGGGC TAGACGTGAT AGCCTCAGCC
CTCAAAGCGG ACATCGAGGA AGCGCTCTGG TTCACGCAAG CATACATGCT TGGAAGCACG
CTCATGCTTC TACTCGCGGG GAAGCTCGCC GACCTCTACG GGAGGGTGAA GCTCTACGCA
TACGGCTTCC TGCTGTTCAC CCTGGGGTCA ATCCTCTCCG GAGCCGCCGC GACCCCCCTA
CAACTGGCCG CCTCGCGCTT CCTGCAGGGC CTCGGCGCGG GTGTTCTCAC AACTCTAAGC
GCGACTATAA TCACGGACGT CGCCGTCGGC GGACCCCTTG CCTTTGCGTT GAGCATAAAC
TCTCTAGCCT TCCGCCTCGG GTCAATCCTG GGCTTAACCG CTAGCGGGCT GATAATAGGC
CTTCTAGGTT GGAGGGGCAT CTTCTACGTC AACGTCCCCG TGGGCATAGC CGGGGCAATA
CTCTCCAGGA AGAGGCTCAG GGAGACCTAC ACGCCGCGCG AAAAACCCCT GATAGACTGG
GTCGGCTTCT GCCTTTTCAC GGTCTCGCTC CTAGGCCTCC TGCTAGCCTT AACGTTCTAC
GCCTACGGTC TCTCCTATAG GAGCTTCGCC CGCCTACTCC TACTGGTATC CGCAGCCTGC
TTCCTCCTCT TCATCGCCGT AGAGGCGAGG AGCGACCACC CGATACTCGA CTTATCCCTG
TTCAGGATCT GGGGCTTCAC GGGGGGAAAC ATAGCCCAGT TCCTGAACGC CGTTGCGTTC
GGCGCGGTCA TGCTGCTCTT AACGCTCTAC TACGAGGTCG CCCTGCGGAA AAGCGCTTTC
GAGACGGGGA TAAGCCTCCT CCCCTTCGAG CTCTCGTTCC TCGCGTTCGG GTTGCTGAGC
GGAAGGCTCT CCGACAGGTA CGGCTACGTC AAGTTCGCCA TACTGGGGCT ACTCGTGGGT
AGCCTCGCAC AGCTTCTACT CGGAGGCTTA ACGGTGAGCA CAAGCCCGGC GCTCGTAGCG
GCATACTCGG CACTGCTCGG GGCTGGGAAC GGGCTCTTCC TGTCGCCGAA CACGAGCGCG
ATAATGAGCT CTGTGCCCCC GGAGAGGAGG GGGGTCGCCT CGGCTATTAG AGCCATAGTC
TTCAACGTCG GGATGACTAT AAGCCTGAAC ATAGCGGTAA TACTTATCTC CACGAGGATC
CCCTACGAGA CGGTAACCCA GCTACTCGTA GGAGCAGAGC TAATGACCTC GGACACCGCT
ACCAGCAGGG CGCTCCTTGT CGACGCGATA GCCTACACGT TCAGGGTCCT CGCGCTGGTA
AACCTCTCGG CGGCCCTCTT CTCCTTCACG AGGCTGAAAG GAGGAAAAAC GGGCCGAGCC
CTCCCGGTCC TCGCCGAGTA A
 
Protein sequence
MDTAHKKYLL TALSITTLGA FMAGLDARIV VVGLDVIASA LKADIEEALW FTQAYMLGST 
LMLLLAGKLA DLYGRVKLYA YGFLLFTLGS ILSGAAATPL QLAASRFLQG LGAGVLTTLS
ATIITDVAVG GPLAFALSIN SLAFRLGSIL GLTASGLIIG LLGWRGIFYV NVPVGIAGAI
LSRKRLRETY TPREKPLIDW VGFCLFTVSL LGLLLALTFY AYGLSYRSFA RLLLLVSAAC
FLLFIAVEAR SDHPILDLSL FRIWGFTGGN IAQFLNAVAF GAVMLLLTLY YEVALRKSAF
ETGISLLPFE LSFLAFGLLS GRLSDRYGYV KFAILGLLVG SLAQLLLGGL TVSTSPALVA
AYSALLGAGN GLFLSPNTSA IMSSVPPERR GVASAIRAIV FNVGMTISLN IAVILISTRI
PYETVTQLLV GAELMTSDTA TSRALLVDAI AYTFRVLALV NLSAALFSFT RLKGGKTGRA
LPVLAE