Gene Tpen_1257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1257 
Symbol 
ID4600423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1193429 
End bp1194832 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content60% 
IMG OID639774033 
Productextracellular solute-binding protein 
Protein accessionYP_920658 
Protein GI119720163 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCCAGC AGGCAACCAA GAAGAAAGTT TCGCCGCTTC TAATCGCCGG AATCCTGGTA 
GTCATAGTGG TCCTCGTCGC CGCGGCGTTC CTGCTCTACA AGCCTCCAGC CGCCCCCACT
AAGCCCTCGA AGATAACGTT CTACACGTGG TGGGCCGGGC TCGAGAGGTT CGCCATAGAC
GCGCTTATCG GCAACTTCAC GAAGAATACG GGGGTTGCTG TTGAAAAGAC GGCGGTACCC
GGAGGCGCGG GGGTAAACGC TAAGTACGCC ATCATAGCGC TGATAATGGC CGGGAAGCCC
CCGGAGGCCT TCCAGGTACA CTGCGGGCCC GAGATGATTA GCTACTTCAT GGCGGCGCCA
CACGGAAAAG ACGACTTCGT CGACCTGACC TCCGTCGGGC AGGAGATAGG CCTCACGGCG
ACTCCTCCCG GGCAGGTGTG CATGCTGAGC GGGCGCCTCT ACACGCTCCC AGTGAACCTC
CACAGGGCTA ACCTGATCTT CATGAACAAG CAGGTACTCG ACAAGTACGG CGTAAAGCCT
CCGACCACGA TCGACGAGCT GAACGCCGCT TGCAGCAAGC TCAAGGCGGC GGGAGTACCG
TGCCTCGTGC AGGCAGGAGC GGACCAGTTC ACGGTGCTAC ACCTCTGGGA GCAGATATTC
CTCGCCGTGG CGGGACCCGA TAAGTTCATA AAGTTCATGT ACGGGACCCT CGACCCCAAC
GACCCCAGCA TAACCCAGGC CACCCAGATA TTCCTCGGCT ACGTTGACAC GTTCCCGCCG
GACTGGATGG CCCTCGACTG GACCTCCGCG GTAGACAGGG TGGTAAAGGG CATGGGAGCC
TTCCACGTGG ACGGGGACTG GGCTGTCGGG CTTATCTACA ACGTCTACCC GAACGTGAAG
ATGTGCCCCA TAGACGCCAT TACCCCTGAC TGCAACATCA TAGTGGCGCC GTTCCCGGGC
ACGCAGGGCA TCTACAACAT GGTCATCGAC GCCGTAGCCG TGCCCAAGGG TCCCGCCCAG
GACCTCGGAG TCCAGTTCGC CAAGTTCTTC GCCTCGAGGG ACGGTCAGAA GATATTCAAC
CCGCTTAAAG GCTCGATAGC GTGCTACGCG GACCTACCGA CCGACATATA CCCGACCTCG
ATACAGAAGT GGGAGGTAAG CCAGTACGCG GCTTCCAAGT CGCAGGTATT CAGCATCACG
CACGGTGCCC TGTTCTCCGA CGTCTGGAGC AAGCTTCTGA GCGGCGCAGT GCTCCTAGCG
CAGACAAAGC AGACCTCGAT GTGGTACTCG ACCGTCAGCG ACGCGATTAA GCTCGAGAGA
CAGCTCTGGG AGCAGAGCGG GCTCTTCCTG GGAACCCCCG AGAAGCCGTT CGCCGGCTAC
CTCCCGCCCT GGGCAAAGAA GTAG
 
Protein sequence
MSQQATKKKV SPLLIAGILV VIVVLVAAAF LLYKPPAAPT KPSKITFYTW WAGLERFAID 
ALIGNFTKNT GVAVEKTAVP GGAGVNAKYA IIALIMAGKP PEAFQVHCGP EMISYFMAAP
HGKDDFVDLT SVGQEIGLTA TPPGQVCMLS GRLYTLPVNL HRANLIFMNK QVLDKYGVKP
PTTIDELNAA CSKLKAAGVP CLVQAGADQF TVLHLWEQIF LAVAGPDKFI KFMYGTLDPN
DPSITQATQI FLGYVDTFPP DWMALDWTSA VDRVVKGMGA FHVDGDWAVG LIYNVYPNVK
MCPIDAITPD CNIIVAPFPG TQGIYNMVID AVAVPKGPAQ DLGVQFAKFF ASRDGQKIFN
PLKGSIACYA DLPTDIYPTS IQKWEVSQYA ASKSQVFSIT HGALFSDVWS KLLSGAVLLA
QTKQTSMWYS TVSDAIKLER QLWEQSGLFL GTPEKPFAGY LPPWAKK