Gene Tpen_1454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1454 
Symbol 
ID4600580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1405159 
End bp1407378 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content53% 
IMG OID639774229 
Productalpha amylase, catalytic region 
Protein accessionYP_920854 
Protein GI119720359 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.137251 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAAGCA GAAAAATCGC CGCAATTTTG GCAATACTTC TCCTACTAGG AGTACTGATT 
GGGACCTCTC AAGCAACAGC GGGACCCAGC CCCTCCTACC CGACGGGGGA TCCCCAGACG
TGGGTTATCT ACCAAATCGT CATCGATAGG TTCTACGACG GGAACACGTC GAACAACAAC
CCTGCGAAAA GCCCGGGTCT CTACGACCCC ACCAAGACTA ACTGGAGGCT GTACTGGGGC
GGCGATATCG ATGGTATAAT AGCGAAACTA CCGTACCTCT ATGAACTCGG AGTTACCGCT
ATATGGATAT CGCCTGTCTT CGACAATATA GACGTCGCTA TAAACACTAG TAGCGGCCTG
CAGGCAGGGT ATCACGGCTA TTGGCCTAAG GACTTTAAAG TAATAGAGGA ACACTTCGGT
TCCTGGAGCA CTTTCTACAA ACTCATACAG GAAGCCAGGA AGTACAACAT CACGGTAATT
ATCGATTTCG TTGTAAACCA CAGCAACCCA AGCGATGCCG GCGAATACGG AGCACTATAC
GATAACGGTA CGTTCGTCAC CGACTATCCA ACGGATGCAA AATACGCTAC GGTTGACCCA
ATAACTCGTA GTCTCTCGAA TATATACAAC CACAATGGGG GGATTACGAA CTGGAACGAC
AGGTGGGAGG TTAGGTACAA GAACCTGTTC AACCTGGCTG ACTTTAACCA GCTGAACCCT
TGGGTGGATA GATACCTCAA GGAATCTACG GCTTTGTACC TGAAGGCCGG TATCGGGGGG
ATACGGCTGG ACGCCGTTAA ACACGTGGAG CCGGGCTGGC TGAAGACGTA CGCCGACTAC
GTGTACGCGA TAAAAAACGT CTTCATGTTC GGAGAGTGGT ACCAAAGCTT TAACGACGAG
ATGTACTGGG ACATGGTTAA GTTCGCGAAC GACAGCGGGA TCAGCGTTAT CAACATACCG
CTTCAGCAGG TCCTAGTAGA CGTATTCGCC TACGACACAA AGACCATGTA CGACTTGGAC
AACGCGGTCA AGAAGTATAC GAGTAACTTT ATGTGGCAAA ACAAGCTGGT TAACTTTATA
GACAGCCACG ACGTGCCGAG GTTCCTCTCG CTGAGCAAGA GTATCACGAG GTTCCACCAG
GTGCTAGCAT TCGTGATGAC CGCCCCCGGC ATCCCGGTGA TATACTACGG GGACGAGCAG
TACCTACACT ACGACGCAAC GAACGAGTTC GGGCAGGTTG GGGGAGATCC TTACAACAGG
CCTATGATGA CATCCTGGGA CACTACGACC ACGGCGTTCA AGTTGATAAA AGCCTTGGCA
CAGTTAAGGC GCGCTAATAC CGCTCTAGCC TACGGCTTGG TAACCACGCG GTACGTGAGT
AGCGACGTGT ACATCTTTGA GAGAAAGTTC TTCGGAAACG TGGTCCTCGT AGCCATAAAC
CGGAACCTAA ACTCCCCGGT TGCCGTTTCC AATGTTTACA CTTCCCTCCC CGACGGGGTG
TACAGCGACT ATCTAGGAGG GCTTATCAAC GGGACAAGCA TCAAAGTCGT AGGCGGTAAG
TTCTCGGTAA CCTTGCCCCC CGGCTCCGTT TCCGTGTGGC AGTACAAAGC AGTACCGAGC
GGTCCATGGG TAGGAGCCAT AGACCCGACG ATGGGCAGGG CTGGGAACGT AGTCGTGATC
AGCGGGGAAG GGTTCGGTAG CCAGCCGGGA CAAGTCCTGA TAACGAACGG GCAGAGCACG
TGGAGCGCTA CAGTTACGTA CTGGAGTGAT AAAAGCATAG AGTTCATAGT TCCCTCAGGG
GTAACAACTC CTCTCAACGA CAACCACGTA ACGGTGATTG TTAAAAGAGC CGACGGGGCG
ACGTCGAACG GGATAGCTTT CCAGTACCTC TCGGGTAGAC AAATCCCCGT TATATTCGAG
GTGCAGAACA CCAAGGGAAC AACCCTGGAG ACAGTGCCTG GAGAGTTCCT GTGGCTAACC
GGTAGCGTCC CAGAGCTAAG CAACTGGAGC CCCGCAACTA CGAGGGCTGT GGGACCCATG
CTTTGCCCAG CGTGGCCTAA CTGGTTCGTC GTCGCCAGTG TCCCGGCGAA TACGTACATA
GAGTTCAAGT TCTTGAAGGC TCCGCTAGGC GGTACCGGGG TCTGGGAGCC TGGAAGCAAC
CATGCTTACA CTACTCCCTC GGACGGGATA GGAAGAGTGT CCGTCACTGC TAACGGGTAA
 
Protein sequence
MRSRKIAAIL AILLLLGVLI GTSQATAGPS PSYPTGDPQT WVIYQIVIDR FYDGNTSNNN 
PAKSPGLYDP TKTNWRLYWG GDIDGIIAKL PYLYELGVTA IWISPVFDNI DVAINTSSGL
QAGYHGYWPK DFKVIEEHFG SWSTFYKLIQ EARKYNITVI IDFVVNHSNP SDAGEYGALY
DNGTFVTDYP TDAKYATVDP ITRSLSNIYN HNGGITNWND RWEVRYKNLF NLADFNQLNP
WVDRYLKEST ALYLKAGIGG IRLDAVKHVE PGWLKTYADY VYAIKNVFMF GEWYQSFNDE
MYWDMVKFAN DSGISVINIP LQQVLVDVFA YDTKTMYDLD NAVKKYTSNF MWQNKLVNFI
DSHDVPRFLS LSKSITRFHQ VLAFVMTAPG IPVIYYGDEQ YLHYDATNEF GQVGGDPYNR
PMMTSWDTTT TAFKLIKALA QLRRANTALA YGLVTTRYVS SDVYIFERKF FGNVVLVAIN
RNLNSPVAVS NVYTSLPDGV YSDYLGGLIN GTSIKVVGGK FSVTLPPGSV SVWQYKAVPS
GPWVGAIDPT MGRAGNVVVI SGEGFGSQPG QVLITNGQST WSATVTYWSD KSIEFIVPSG
VTTPLNDNHV TVIVKRADGA TSNGIAFQYL SGRQIPVIFE VQNTKGTTLE TVPGEFLWLT
GSVPELSNWS PATTRAVGPM LCPAWPNWFV VASVPANTYI EFKFLKAPLG GTGVWEPGSN
HAYTTPSDGI GRVSVTANG