Gene Tpen_1653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1653 
Symbol 
ID4601730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1600073 
End bp1601239 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content60% 
IMG OID639774426 
Productmajor facilitator transporter 
Protein accessionYP_921051 
Protein GI119720556 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCGGAGA AAGACGGGAA GCCGGGGATA AGGAACGTCT ACGCGCTGGG CTTCGTCAGC 
CTCTTCACCG ATGTTTCAAC GGAGATGATT AACGGCTACC TCCCCGTGTT CCTCGTCCAG
GAGCTTGGGG CTACCCGCGC GCTACTCGGG CTCATCGAGG GTATCGGAGA GCTCGCGAAC
TACTTCTTCA GGCTCGTGAG CGGGTACATA TCCGACAGAC TTGGACGTAG AAAAGCGCTC
GTGTTCGCCG GCTACTCCCT GAGCGCGGTC TCGAAGCCGC TGTTCGCGTT CGCCCATAGC
CCTTGGGACG CACTGGTCGT AAAAGCCCTC GACCGCACAG GTAAGGGTAT CAGAACCTCG
CCGCGTGACG CCTTGATAAG CCAGTCCATA GACGAGGAAT CCTCTGGGAA AGCCTTCGGG
CTTCACAGAA CGATCGACCA GAGTGGAGCC ATGATAGGTC CCCTCCTCGC AACGCTACTA
TTGCCGCTCA TAGGGCCGAG GAACCTCTTC CTAGTCTCCT TCATTCCCGC GGTGATCGCC
CTGGCTATAC TGCTAGCCTT CGTCGTAGAC GTGAAGACGG AGAGCAGGGA GGCCAGGATC
CTTAAAGGCG CAAGACAGGT GCTGGGCAAC AAGAGGCTGC TACTCGTGCT CGCCGCTTTC
GCCGTGATGG GCCTCGGATA CTACGACTTC TCCTTCCTGC TCGTGAGGTC AAGGGAGGTG
GGCGTAGCCG CCGACCTGGT ACCCCTAGTC TACCTTGCCA TAAACCTGTT CCACACGGCC
GTAGGCTACC CCTCAGGCGT ACTCTCGGAC AGGGTAGGCA AGGAGAAGGT CCTCGTAGGC
TCCCTAGTCT TCTTCGCCCT CGCGTCGATA GCCCTCGCGA GGACGGAGAA CCTTGCGGGC
TTCGCCGTAG CAGTCGTACT CTACGGGGTG TTCTTCGGGA GCTACGAGAC AGTCTCCAGG
GCTATCCTGC CGAGGTTTGC TCCTCCAGAG CTTCGGGGAA CCGTGTACGG CGTCTTCTAC
ATAGCCACGG GTCTCGCAAC CTTAGCCGGT ATGACCGTCG TAGGCTACCT GTGGGACACT
GCGGGCAGGG CGGTAGCGTT CACCTACAGC GCAACTCTCT CGCTGATCGC CGCGCTACTC
TTCGCTTACT CCACTCGCAT AAGCTAA
 
Protein sequence
MPEKDGKPGI RNVYALGFVS LFTDVSTEMI NGYLPVFLVQ ELGATRALLG LIEGIGELAN 
YFFRLVSGYI SDRLGRRKAL VFAGYSLSAV SKPLFAFAHS PWDALVVKAL DRTGKGIRTS
PRDALISQSI DEESSGKAFG LHRTIDQSGA MIGPLLATLL LPLIGPRNLF LVSFIPAVIA
LAILLAFVVD VKTESREARI LKGARQVLGN KRLLLVLAAF AVMGLGYYDF SFLLVRSREV
GVAADLVPLV YLAINLFHTA VGYPSGVLSD RVGKEKVLVG SLVFFALASI ALARTENLAG
FAVAVVLYGV FFGSYETVSR AILPRFAPPE LRGTVYGVFY IATGLATLAG MTVVGYLWDT
AGRAVAFTYS ATLSLIAALL FAYSTRIS