Gene Tpen_1109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1109 
Symbol 
ID4601103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1046553 
End bp1047833 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content57% 
IMG OID639773886 
Productmajor facilitator transporter 
Protein accessionYP_920511 
Protein GI119720016 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAGCA AATATAAATG GTTCGTGGTT CTGTTCTTCT TCACCTTCCT GACTATACAC 
CAGGCGGACC GCTTCATAGT CTCGGCTGTT GCGCCGCAAG TAATGGACGA GTTCAAGGTG
TCGTACAGCC AGCTAGGTCT AGTATTCTCT CTTACGGTGC TGGTAGCCGC TTTCCTCTAC
CCGGTGTGGG GCTACCTCTA CGACAGGTAC TCGCGGAAGC TCTTAGCCGG TCTAGCGGCG
CTGATATGGG GTTTCACCAC CATCTTCAAC GCCCTCTCGA GAACCTTCTC CGAGTTCTTC
GCTACGAGGC TCGCCACGGG TATCGACGAC GCGGCGCCTC CCGGAATTTA TAGCCTCGTG
GCAGACTACT TTGACCCCTA CAGCCGCGGG AAGGCTCTCG GCTTGCTTAA CGCGACGGGG
CCTCTCGGGG CGATCATAGG GACTATACTC TCGTTGAGCA TAGTGGCGGC GGGGCTTAGC
TGGAGGAACG CGTTCTTCAT AACTGGTCCC ATAGGGGTTG CGATCGGCGC GTTAACCTTC
TTCCTGGTAA AGGATGTGCC GAGGGGTGTT TCCGAGCCAG AGCTCAAGGA CGTGTTAACG
GAGGATATCT ACAGGGCGAA GCTATCCGAC CTGCCAAAGG TGCTGGAGAA CAAATCCCTC
GTACTCCTCT ACCTGCAGGG CTTCTGGGGC GTTTTCCCGT GGAACGCGAT TACCTTCTGG
TTCGTGACGT ACATGGAGAA GGAGAGGGGG CTGTCCCCCG ACACCGTGAT GGTGGTAATG
TCCCTGTCGC TCATCGCCAT GGTCGCAGGG AACATAGTCG CAGGGATTAT CGGAGACTGG
TTGTTCAAAA AGACGAAGAG GGGTAGGGCG ATTCTCGGGG CGGTAGTAGT GTTCTTCTCG
GCAGTGCTCA TCTACCTCGC GATTAGGGCT GAAAGCACGG AGGAGTTCAT TCTGTTCACT
GTTCTCACAG CGTTCGAGAT TCCCATGGCG GCCCCGAACG TAGTCGCTGC CATCACGGAT
GTCACCGAGC CCGAGCTGAG GTCCAGCGCT ACCGGATACC TAAGGTTCTT CGAGAACCTC
GGTAGCGCTA CGTCGCCGTT TCTGACAGGC GTACTGGCGG AGTCCATGGG CCTTGGGGAG
GCTATACTGC TGGTAAGCGT GTATACGTGG CTTCTGTGCT TCGTCTTCTT CGCGGTACTA
GCAGCGATAA TCCCCCGCGA CATAGACAGG CTGAGAAACC TCATTAGGGA GAGGGCGGAG
AGACTGAGGG GTGGGCGCTG A
 
Protein sequence
MRSKYKWFVV LFFFTFLTIH QADRFIVSAV APQVMDEFKV SYSQLGLVFS LTVLVAAFLY 
PVWGYLYDRY SRKLLAGLAA LIWGFTTIFN ALSRTFSEFF ATRLATGIDD AAPPGIYSLV
ADYFDPYSRG KALGLLNATG PLGAIIGTIL SLSIVAAGLS WRNAFFITGP IGVAIGALTF
FLVKDVPRGV SEPELKDVLT EDIYRAKLSD LPKVLENKSL VLLYLQGFWG VFPWNAITFW
FVTYMEKERG LSPDTVMVVM SLSLIAMVAG NIVAGIIGDW LFKKTKRGRA ILGAVVVFFS
AVLIYLAIRA ESTEEFILFT VLTAFEIPMA APNVVAAITD VTEPELRSSA TGYLRFFENL
GSATSPFLTG VLAESMGLGE AILLVSVYTW LLCFVFFAVL AAIIPRDIDR LRNLIRERAE
RLRGGR