Gene Tpen_1683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1683 
Symbol 
ID4600571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1631173 
End bp1632420 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content67% 
IMG OID639774456 
Productmajor facilitator transporter 
Protein accessionYP_921081 
Protein GI119720586 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.249309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTCGC GCCTACTCCC CGTCTTGATG GCTGGCTGGG CTATCGGCGC GATGTACGCG 
GGAGTGGTCA GCGGGACCTT GACGCTGATA AGGGCGGAGC TCGCGATGGG AGCGGAGGAG
GCCGGGAGGG TTCTGAGCAG CTGGCTCCTC GGGATGCTCC TCGGCGCCTC CCTGATCGGC
TACCTCTCCG ACAGGGTTGG GCGTAGGGCG TCCCTCGTCG CCTCCTACGC ACTCATGGGT
GTTTTCACCC CTATGAGCGC CGCGGCCCGG GGATGGCTGG ACCTATCGGT GTACAGGGTG
CTCGCGGGGG CCGGGAACGC GGGCTACATG GTCACGGCGA GCGTGCTCCT AGCGGAGTAC
GCGCCGACCT CTTCCAGGGG CAGGCAGGTC GCAGTGCTGG AGAGCGCGTG GGCGCTCGGG
TGGCTGGCAT CCCTCGTGCT ATCGCGGCTA ATCGCGCCCA GCTACGGCTG GCGCGCCGTC
TTCTTAGCGT CCAGCGCGGC GCTGGCACTG CTCCCTGTTC TCTACGTAGC CGTCCCGGAG
TCCCTCAGGT TCCTCGTGTC TAAGGGTAGG CTCGCCGAAG CCGAGGCCCT CGCGGGGAGG
CTCGGGGTAG AGGTGCCGCG GGCCCAGCCG GCCGCTAGGG CCGGGCTGAG GGACCTCGTC
GGGAGGAGGT ACCTGAGGAG GAGCGTGATG CTGTGGATCC ACTGGTTCAT GCTCGTACTC
GCGTACTGGG GGATCTTCCT CTGGCTCCCC GACATACTCT ACAAGCGCGG CATACCCTTC
GTCAGGAGCC TCGACTACGC GATACTGATA ACGCTAGCCC AGATACCGGG CTACCTGTCG
GGAGCCTACC TCGTAGAGGT CGCCGGCAGG AAGCCCGTCC TCGCCGCCTA CATGCTGGGC
GCGGGCATCG CGAGCGCCGG GCTGTGGGCC GCGAAGACAG ACCTCGAGGC CCTCGCGTGG
GGCGTGCTAG TCTCGTTCTT CAACCTCGGC GCCTGGGGCG TGACCTACGC CTACACTCCG
GAGCTGTACC CAACCGAGCT CAGGGGAACC GGTAGCGGGT GGGCAAACGC ATTCGGGAGG
ATAGGCGGGA TCCTCGGACC CTACGTGGCC GGCGTCATGA TACAGTCGTA CGGGAACCCC
TCGGCCCCCT TCGCCGTCTT CGCGCTGGCG CACGTGGTCT CCGCCGCCGT GGTCGCCGCG
CTGGGCGTCG AGACGAAGGG GAGAAGCCTC GAAGAGGTCT CGGCTTAA
 
Protein sequence
MKSRLLPVLM AGWAIGAMYA GVVSGTLTLI RAELAMGAEE AGRVLSSWLL GMLLGASLIG 
YLSDRVGRRA SLVASYALMG VFTPMSAAAR GWLDLSVYRV LAGAGNAGYM VTASVLLAEY
APTSSRGRQV AVLESAWALG WLASLVLSRL IAPSYGWRAV FLASSAALAL LPVLYVAVPE
SLRFLVSKGR LAEAEALAGR LGVEVPRAQP AARAGLRDLV GRRYLRRSVM LWIHWFMLVL
AYWGIFLWLP DILYKRGIPF VRSLDYAILI TLAQIPGYLS GAYLVEVAGR KPVLAAYMLG
AGIASAGLWA AKTDLEALAW GVLVSFFNLG AWGVTYAYTP ELYPTELRGT GSGWANAFGR
IGGILGPYVA GVMIQSYGNP SAPFAVFALA HVVSAAVVAA LGVETKGRSL EEVSA