Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1454 |
Symbol | |
ID | 4600580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1405159 |
End bp | 1407378 |
Gene Length | 2220 bp |
Protein Length | 739 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639774229 |
Product | alpha amylase, catalytic region |
Protein accession | YP_920854 |
Protein GI | 119720359 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.137251 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGAAGCA GAAAAATCGC CGCAATTTTG GCAATACTTC TCCTACTAGG AGTACTGATT GGGACCTCTC AAGCAACAGC GGGACCCAGC CCCTCCTACC CGACGGGGGA TCCCCAGACG TGGGTTATCT ACCAAATCGT CATCGATAGG TTCTACGACG GGAACACGTC GAACAACAAC CCTGCGAAAA GCCCGGGTCT CTACGACCCC ACCAAGACTA ACTGGAGGCT GTACTGGGGC GGCGATATCG ATGGTATAAT AGCGAAACTA CCGTACCTCT ATGAACTCGG AGTTACCGCT ATATGGATAT CGCCTGTCTT CGACAATATA GACGTCGCTA TAAACACTAG TAGCGGCCTG CAGGCAGGGT ATCACGGCTA TTGGCCTAAG GACTTTAAAG TAATAGAGGA ACACTTCGGT TCCTGGAGCA CTTTCTACAA ACTCATACAG GAAGCCAGGA AGTACAACAT CACGGTAATT ATCGATTTCG TTGTAAACCA CAGCAACCCA AGCGATGCCG GCGAATACGG AGCACTATAC GATAACGGTA CGTTCGTCAC CGACTATCCA ACGGATGCAA AATACGCTAC GGTTGACCCA ATAACTCGTA GTCTCTCGAA TATATACAAC CACAATGGGG GGATTACGAA CTGGAACGAC AGGTGGGAGG TTAGGTACAA GAACCTGTTC AACCTGGCTG ACTTTAACCA GCTGAACCCT TGGGTGGATA GATACCTCAA GGAATCTACG GCTTTGTACC TGAAGGCCGG TATCGGGGGG ATACGGCTGG ACGCCGTTAA ACACGTGGAG CCGGGCTGGC TGAAGACGTA CGCCGACTAC GTGTACGCGA TAAAAAACGT CTTCATGTTC GGAGAGTGGT ACCAAAGCTT TAACGACGAG ATGTACTGGG ACATGGTTAA GTTCGCGAAC GACAGCGGGA TCAGCGTTAT CAACATACCG CTTCAGCAGG TCCTAGTAGA CGTATTCGCC TACGACACAA AGACCATGTA CGACTTGGAC AACGCGGTCA AGAAGTATAC GAGTAACTTT ATGTGGCAAA ACAAGCTGGT TAACTTTATA GACAGCCACG ACGTGCCGAG GTTCCTCTCG CTGAGCAAGA GTATCACGAG GTTCCACCAG GTGCTAGCAT TCGTGATGAC CGCCCCCGGC ATCCCGGTGA TATACTACGG GGACGAGCAG TACCTACACT ACGACGCAAC GAACGAGTTC GGGCAGGTTG GGGGAGATCC TTACAACAGG CCTATGATGA CATCCTGGGA CACTACGACC ACGGCGTTCA AGTTGATAAA AGCCTTGGCA CAGTTAAGGC GCGCTAATAC CGCTCTAGCC TACGGCTTGG TAACCACGCG GTACGTGAGT AGCGACGTGT ACATCTTTGA GAGAAAGTTC TTCGGAAACG TGGTCCTCGT AGCCATAAAC CGGAACCTAA ACTCCCCGGT TGCCGTTTCC AATGTTTACA CTTCCCTCCC CGACGGGGTG TACAGCGACT ATCTAGGAGG GCTTATCAAC GGGACAAGCA TCAAAGTCGT AGGCGGTAAG TTCTCGGTAA CCTTGCCCCC CGGCTCCGTT TCCGTGTGGC AGTACAAAGC AGTACCGAGC GGTCCATGGG TAGGAGCCAT AGACCCGACG ATGGGCAGGG CTGGGAACGT AGTCGTGATC AGCGGGGAAG GGTTCGGTAG CCAGCCGGGA CAAGTCCTGA TAACGAACGG GCAGAGCACG TGGAGCGCTA CAGTTACGTA CTGGAGTGAT AAAAGCATAG AGTTCATAGT TCCCTCAGGG GTAACAACTC CTCTCAACGA CAACCACGTA ACGGTGATTG TTAAAAGAGC CGACGGGGCG ACGTCGAACG GGATAGCTTT CCAGTACCTC TCGGGTAGAC AAATCCCCGT TATATTCGAG GTGCAGAACA CCAAGGGAAC AACCCTGGAG ACAGTGCCTG GAGAGTTCCT GTGGCTAACC GGTAGCGTCC CAGAGCTAAG CAACTGGAGC CCCGCAACTA CGAGGGCTGT GGGACCCATG CTTTGCCCAG CGTGGCCTAA CTGGTTCGTC GTCGCCAGTG TCCCGGCGAA TACGTACATA GAGTTCAAGT TCTTGAAGGC TCCGCTAGGC GGTACCGGGG TCTGGGAGCC TGGAAGCAAC CATGCTTACA CTACTCCCTC GGACGGGATA GGAAGAGTGT CCGTCACTGC TAACGGGTAA
|
Protein sequence | MRSRKIAAIL AILLLLGVLI GTSQATAGPS PSYPTGDPQT WVIYQIVIDR FYDGNTSNNN PAKSPGLYDP TKTNWRLYWG GDIDGIIAKL PYLYELGVTA IWISPVFDNI DVAINTSSGL QAGYHGYWPK DFKVIEEHFG SWSTFYKLIQ EARKYNITVI IDFVVNHSNP SDAGEYGALY DNGTFVTDYP TDAKYATVDP ITRSLSNIYN HNGGITNWND RWEVRYKNLF NLADFNQLNP WVDRYLKEST ALYLKAGIGG IRLDAVKHVE PGWLKTYADY VYAIKNVFMF GEWYQSFNDE MYWDMVKFAN DSGISVINIP LQQVLVDVFA YDTKTMYDLD NAVKKYTSNF MWQNKLVNFI DSHDVPRFLS LSKSITRFHQ VLAFVMTAPG IPVIYYGDEQ YLHYDATNEF GQVGGDPYNR PMMTSWDTTT TAFKLIKALA QLRRANTALA YGLVTTRYVS SDVYIFERKF FGNVVLVAIN RNLNSPVAVS NVYTSLPDGV YSDYLGGLIN GTSIKVVGGK FSVTLPPGSV SVWQYKAVPS GPWVGAIDPT MGRAGNVVVI SGEGFGSQPG QVLITNGQST WSATVTYWSD KSIEFIVPSG VTTPLNDNHV TVIVKRADGA TSNGIAFQYL SGRQIPVIFE VQNTKGTTLE TVPGEFLWLT GSVPELSNWS PATTRAVGPM LCPAWPNWFV VASVPANTYI EFKFLKAPLG GTGVWEPGSN HAYTTPSDGI GRVSVTANG
|
| |