Gene Tpen_1458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1458 
Symbol 
ID4600584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1410335 
End bp1412269 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content62% 
IMG OID639774233 
Productalpha amylase, catalytic region 
Protein accessionYP_920858 
Protein GI119720363 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.628846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTACAGGG TTCTAGGCTT CAGGGACGAC GTCTACCTCG GCAGGGTTGT GAAGGCGGAG 
TTCAGCGCCC CGAGGGAGGG GGAGTACGCC TACCTGCTCG GCAACTTCAA CGCGTTTAAC
GAGGGAAGCT TCAGGATGCG GGGCGCGGGC GACAGGTGGG TCGTCGAGGT AGAGCTACCC
GAGGGGGTCT GGTACTACCT CTTCTCGCTG GGGGGTAGGC GCGCGGTCGA CCCGGAGAAC
CCCGAGACCA CCGTCTACTC GAGGAGGGCT TACAAGTTTG AGGAGAGGGT TAGCGTGGCT
AAGCTCCTTG GCTTCGACCC GGCGTCCTGC AACGGCTTCT GCGAGGAGGC ATTGTACCAC
TACCCGAGCT TGACCTACGT TTACCCCTTC GGGGGCGTGC TCTTCGTTAG GCTCAGGGCG
CTCAGGGGGA GCCTCCAGAA GGCTTTCTTG GTTGTCGACG GCAGGAGGCT GGAGATGAGG
CTGAAGGCCC GCGACGAGGT ATTCGACTAC TACGAGGCGA GCCTCGAGGC GGGCGGGGAG
GTATCCTACT ACTTTGAGGT TCTCGGGGGA GGGAGGCTCC ACCGCTACGG GGAGTTCTCC
GTAGACGTCA AGTCCCTGGA AAGCCTTATC CGGGTGCCGG AGTGGGTGTA CGGAAGCGTG
TTCTACCAGA TTATGCCGGA CAGGTTCGCG GAGGGAGGCC TCGAAGAGAT AGCCGAAAGG
CTAAACCACG TCTCGGGGCT GGGGGCGAAC GCGCTGTACC TTACCCCCAT CTTCGAGTCC
ACGACTTACC ACGGCTACGA CGTCGTGGAC TACTACCGCG TAGCCGGCAG GCTCGGCGGG
GACGAGGCGT TCGGGAGGCT CCTCGCGGAG CTGAAGAAGA GGGGGATGAG GGTAGTACTG
GACGGAGTCT TCCACCACAC GAGCTTCTTT CACCCGTACT TCCAGGACCT CGTGGAGAAG
GGGGAGGAGT CGCGGTACAA GGGCTTCTAC AGGGTGCTGG GCTTCCCCGT CGTCCCGCGG
GAGTTCCTCG AAGCCCTGAG GTCCGGGGCG CCGCGGCACG AGCTGAAGAA GTACCCGCGG
AGGTACGAGA GCTTCTTCGA CGTATGGCTG ATGCCCCGCC TGAACCACGA CAACCCGGAG
GTCAGGAGCT TCATAACCGG CGTCGGCAGG TACTGGGTCT CCAGGGGGGT AGACGGCTGG
AGGCTAGACG TGGCGCACGG CGTGCCCCCC GAGCTTTGGA GGGAGTTCAG GGAGACCCTC
CCAGGGGACG TCTACCTCTT CGGCGAGGTC ATGGACGACG CGCGCATATG GCTCTTCGAC
AAGTTCCACG GCGCTATGAA CTACCTGCTC TACGACGCGG TTCTCAGGTT CTTCGCCTAC
CGGGAGATAA CCGCCGAGGA GTTCCTCAAC AGGCTCGAGC TTCTAAGCGT GTACTACGGC
CCCGGGGAGT ACGCGATGTA CAACTTCCTC GACAACCACG ACGTGGACAG GCTCCTATCC
CTCGTGGGCG ACAGGGACAA GTACCTCTGC GCCCTGGTCT TCCTCTTCAC GTACAAGGGG
GTCCCCTCCA TATACTACGG CGACGAGGTA GGCCTGGAGA ACACGGACTC GCCGTTCATG
GAGCGTTCCA GGGCCCCCAT GCGCTGGGAC GAGTCAACCT GGGACAAAGC GATACTGGAG
GCTACGAGGG CGCTGGCGTC GCTTAGGAGG AGGAGCGCGG CGCTACAGAG AGGGGCATTC
GAGCCGGTGA GATTCGAGGG AGGGCTACTC GTGTACAGGA GGAGACTCGG CGACGAAAGC
ATCCTCGTCG CCATAAACTA CTCCGAAAGC GAAGCCGTAC TCGAAGAGCC CGCGCAGAGC
GTGCTCTTCC GCTCGGGAAG CGTCAAAGAA AAGCTTCTAG GACCGTTCTC CAGCGTAGTC
GCCGGAGACC GCTAA
 
Protein sequence
MYRVLGFRDD VYLGRVVKAE FSAPREGEYA YLLGNFNAFN EGSFRMRGAG DRWVVEVELP 
EGVWYYLFSL GGRRAVDPEN PETTVYSRRA YKFEERVSVA KLLGFDPASC NGFCEEALYH
YPSLTYVYPF GGVLFVRLRA LRGSLQKAFL VVDGRRLEMR LKARDEVFDY YEASLEAGGE
VSYYFEVLGG GRLHRYGEFS VDVKSLESLI RVPEWVYGSV FYQIMPDRFA EGGLEEIAER
LNHVSGLGAN ALYLTPIFES TTYHGYDVVD YYRVAGRLGG DEAFGRLLAE LKKRGMRVVL
DGVFHHTSFF HPYFQDLVEK GEESRYKGFY RVLGFPVVPR EFLEALRSGA PRHELKKYPR
RYESFFDVWL MPRLNHDNPE VRSFITGVGR YWVSRGVDGW RLDVAHGVPP ELWREFRETL
PGDVYLFGEV MDDARIWLFD KFHGAMNYLL YDAVLRFFAY REITAEEFLN RLELLSVYYG
PGEYAMYNFL DNHDVDRLLS LVGDRDKYLC ALVFLFTYKG VPSIYYGDEV GLENTDSPFM
ERSRAPMRWD ESTWDKAILE ATRALASLRR RSAALQRGAF EPVRFEGGLL VYRRRLGDES
ILVAINYSES EAVLEEPAQS VLFRSGSVKE KLLGPFSSVV AGDR