Gene Tpen_0135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0135 
Symbol 
ID4602178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp110293 
End bp111849 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content57% 
IMG OID639772889 
ProductAlpha-amylase 
Protein accessionYP_919548 
Protein GI119719053 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGCACG TAGTATTCAT GTTCGAGGTT CACCAGCCCT ACAGGCTCAG CAGGAACATC 
CGAGGCACCC TCTCAGAGCT TACGGCGAGG AAAGGGAAGA TAACCGCCGA GGACCTGGAA
AGGGTTTACT TCGACGACGA GCTTAACAGA AGGATATTCG AGAGAGTCTC CAAGAGGTGC
TACTTACCGG CGAACAACGT CATAAGAGAT ATGGTGGAGT TCTTCAAGGA CTCGGGTAAA
CCGTTCAAGG TGTCATACAG CCTATCGGGA GTACTGTTGG AACAGGCGGA GAAATGGTAC
CCGGAGGTTA TAGAGAGCTT TAGGTCCCTC GCTAAGACTG GCATGGTCGA GTTCCTCGAT
CAAACGTACT ACCACAGCCT GGCCTTTCTC TTCGCTGAGG AGGAGCTTCT GGAGCAGGTA
AGGGAGCATA GGGAGGCTTT GCGGAAACAC TTGGGCGTCG AGCCCACCGC TGTGGAGAAC
ACCGAGTTCA CGTACAACAA CTACCTCGCG TGCCTCTTCG ACAAGCTGGG CTACAAGGTA
ATCCTAACCG AGGGTGTTGA GAGGGTTCTC GGGTGGAGGA GCCCCAACTA CCTCTACAAG
GCTAAAGGGT GCAACATACG CGTGCTGATG AGGAACTACC GCCTCTCCGA CGACATAGGT
TTCCGCTTCG GGGCTCGATG GTGGAGCGAG TACCCCCTCA CGGCGGACAA GTACTCGGCG
TGGCTCTCCG CGACCCCGGG CGAGGTCATC CTGGTAGCCA TCGACTACGA AACCTTCGGC
GAGCACTTCC CGGCGGAGAC GGGGATCTTC GAGTTCCTCA GGTGGCTTCC AGGAGAGATT
CTAAAGTGGG GGAACCTCGT GACGTCCACG CCGAGCGAGG TTGCCGAGAG GCTAACCCCC
AGGGACGAGG TGGACGTACC GGTGGAGTCG ACTATCTCCT GGGCGGATCT TGAGAGGGAC
CTCTCAGCGT GGATCGGGAA CTTCATGCAG AACAACGCTT TTACGAGGAT GAAGGACCTT
TGGTTGCAAA TAAAGGCTGT CCGGGACCGC GGACTCGAGA GGCTGTGGAA ACTGCTGTCG
ATAAGCGATC ACTACTACTA CATGTCCACC AAGGGTGGAG GACCCGGGGA CGTGCATTCC
TACTTCAGCC CCTACGGGAG CCCCTTTGAA GCCTACATGT TCTTCTCGGA GGCTCTCGGG
GACCTCGAGA CCAGGACCCT ACTGAGGCTT GAGAGCGACG AGACCGCGCG GTTAAGGTAT
GCCTGGATGA AGGATGTCCC ACGCGAGAAA GTCTTCGCTT TCTACAGGGC GCCGGGGCAG
CCGCTGGGCA GGTACGCTTG GAACATGTTC TCGTTTATAA ACGCCCTCAG AGAGGTCCCG
CTGGAGAGCG TCCTGTTCCA CCAGGAGAGA GGCGACTTTG CGAGCTGGGT CGAGTACATA
GTGGGGGACC ACGAACTCGC GGAAAGACTT AGGCGCGTCG GGACGGGAGG ATCGGAGGAG
GTTCGCTCCA GGATCTTAGA GGTAATCGAG CGGAGAAGGG CTGAGATCTT TGGCTGA
 
Protein sequence
MKHVVFMFEV HQPYRLSRNI RGTLSELTAR KGKITAEDLE RVYFDDELNR RIFERVSKRC 
YLPANNVIRD MVEFFKDSGK PFKVSYSLSG VLLEQAEKWY PEVIESFRSL AKTGMVEFLD
QTYYHSLAFL FAEEELLEQV REHREALRKH LGVEPTAVEN TEFTYNNYLA CLFDKLGYKV
ILTEGVERVL GWRSPNYLYK AKGCNIRVLM RNYRLSDDIG FRFGARWWSE YPLTADKYSA
WLSATPGEVI LVAIDYETFG EHFPAETGIF EFLRWLPGEI LKWGNLVTST PSEVAERLTP
RDEVDVPVES TISWADLERD LSAWIGNFMQ NNAFTRMKDL WLQIKAVRDR GLERLWKLLS
ISDHYYYMST KGGGPGDVHS YFSPYGSPFE AYMFFSEALG DLETRTLLRL ESDETARLRY
AWMKDVPREK VFAFYRAPGQ PLGRYAWNMF SFINALREVP LESVLFHQER GDFASWVEYI
VGDHELAERL RRVGTGGSEE VRSRILEVIE RRRAEIFG