Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1458 |
Symbol | |
ID | 4600584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1410335 |
End bp | 1412269 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639774233 |
Product | alpha amylase, catalytic region |
Protein accession | YP_920858 |
Protein GI | 119720363 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.628846 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTACAGGG TTCTAGGCTT CAGGGACGAC GTCTACCTCG GCAGGGTTGT GAAGGCGGAG TTCAGCGCCC CGAGGGAGGG GGAGTACGCC TACCTGCTCG GCAACTTCAA CGCGTTTAAC GAGGGAAGCT TCAGGATGCG GGGCGCGGGC GACAGGTGGG TCGTCGAGGT AGAGCTACCC GAGGGGGTCT GGTACTACCT CTTCTCGCTG GGGGGTAGGC GCGCGGTCGA CCCGGAGAAC CCCGAGACCA CCGTCTACTC GAGGAGGGCT TACAAGTTTG AGGAGAGGGT TAGCGTGGCT AAGCTCCTTG GCTTCGACCC GGCGTCCTGC AACGGCTTCT GCGAGGAGGC ATTGTACCAC TACCCGAGCT TGACCTACGT TTACCCCTTC GGGGGCGTGC TCTTCGTTAG GCTCAGGGCG CTCAGGGGGA GCCTCCAGAA GGCTTTCTTG GTTGTCGACG GCAGGAGGCT GGAGATGAGG CTGAAGGCCC GCGACGAGGT ATTCGACTAC TACGAGGCGA GCCTCGAGGC GGGCGGGGAG GTATCCTACT ACTTTGAGGT TCTCGGGGGA GGGAGGCTCC ACCGCTACGG GGAGTTCTCC GTAGACGTCA AGTCCCTGGA AAGCCTTATC CGGGTGCCGG AGTGGGTGTA CGGAAGCGTG TTCTACCAGA TTATGCCGGA CAGGTTCGCG GAGGGAGGCC TCGAAGAGAT AGCCGAAAGG CTAAACCACG TCTCGGGGCT GGGGGCGAAC GCGCTGTACC TTACCCCCAT CTTCGAGTCC ACGACTTACC ACGGCTACGA CGTCGTGGAC TACTACCGCG TAGCCGGCAG GCTCGGCGGG GACGAGGCGT TCGGGAGGCT CCTCGCGGAG CTGAAGAAGA GGGGGATGAG GGTAGTACTG GACGGAGTCT TCCACCACAC GAGCTTCTTT CACCCGTACT TCCAGGACCT CGTGGAGAAG GGGGAGGAGT CGCGGTACAA GGGCTTCTAC AGGGTGCTGG GCTTCCCCGT CGTCCCGCGG GAGTTCCTCG AAGCCCTGAG GTCCGGGGCG CCGCGGCACG AGCTGAAGAA GTACCCGCGG AGGTACGAGA GCTTCTTCGA CGTATGGCTG ATGCCCCGCC TGAACCACGA CAACCCGGAG GTCAGGAGCT TCATAACCGG CGTCGGCAGG TACTGGGTCT CCAGGGGGGT AGACGGCTGG AGGCTAGACG TGGCGCACGG CGTGCCCCCC GAGCTTTGGA GGGAGTTCAG GGAGACCCTC CCAGGGGACG TCTACCTCTT CGGCGAGGTC ATGGACGACG CGCGCATATG GCTCTTCGAC AAGTTCCACG GCGCTATGAA CTACCTGCTC TACGACGCGG TTCTCAGGTT CTTCGCCTAC CGGGAGATAA CCGCCGAGGA GTTCCTCAAC AGGCTCGAGC TTCTAAGCGT GTACTACGGC CCCGGGGAGT ACGCGATGTA CAACTTCCTC GACAACCACG ACGTGGACAG GCTCCTATCC CTCGTGGGCG ACAGGGACAA GTACCTCTGC GCCCTGGTCT TCCTCTTCAC GTACAAGGGG GTCCCCTCCA TATACTACGG CGACGAGGTA GGCCTGGAGA ACACGGACTC GCCGTTCATG GAGCGTTCCA GGGCCCCCAT GCGCTGGGAC GAGTCAACCT GGGACAAAGC GATACTGGAG GCTACGAGGG CGCTGGCGTC GCTTAGGAGG AGGAGCGCGG CGCTACAGAG AGGGGCATTC GAGCCGGTGA GATTCGAGGG AGGGCTACTC GTGTACAGGA GGAGACTCGG CGACGAAAGC ATCCTCGTCG CCATAAACTA CTCCGAAAGC GAAGCCGTAC TCGAAGAGCC CGCGCAGAGC GTGCTCTTCC GCTCGGGAAG CGTCAAAGAA AAGCTTCTAG GACCGTTCTC CAGCGTAGTC GCCGGAGACC GCTAA
|
Protein sequence | MYRVLGFRDD VYLGRVVKAE FSAPREGEYA YLLGNFNAFN EGSFRMRGAG DRWVVEVELP EGVWYYLFSL GGRRAVDPEN PETTVYSRRA YKFEERVSVA KLLGFDPASC NGFCEEALYH YPSLTYVYPF GGVLFVRLRA LRGSLQKAFL VVDGRRLEMR LKARDEVFDY YEASLEAGGE VSYYFEVLGG GRLHRYGEFS VDVKSLESLI RVPEWVYGSV FYQIMPDRFA EGGLEEIAER LNHVSGLGAN ALYLTPIFES TTYHGYDVVD YYRVAGRLGG DEAFGRLLAE LKKRGMRVVL DGVFHHTSFF HPYFQDLVEK GEESRYKGFY RVLGFPVVPR EFLEALRSGA PRHELKKYPR RYESFFDVWL MPRLNHDNPE VRSFITGVGR YWVSRGVDGW RLDVAHGVPP ELWREFRETL PGDVYLFGEV MDDARIWLFD KFHGAMNYLL YDAVLRFFAY REITAEEFLN RLELLSVYYG PGEYAMYNFL DNHDVDRLLS LVGDRDKYLC ALVFLFTYKG VPSIYYGDEV GLENTDSPFM ERSRAPMRWD ESTWDKAILE ATRALASLRR RSAALQRGAF EPVRFEGGLL VYRRRLGDES ILVAINYSES EAVLEEPAQS VLFRSGSVKE KLLGPFSSVV AGDR
|
| |