Gene Tpen_1511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1511 
Symbol 
ID4601107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1459316 
End bp1460698 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content61% 
IMG OID639774286 
ProductAlpha-galactosidase 
Protein accessionYP_920911 
Protein GI119720416 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.408355 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGCGCG TGCCGAAAAT AAAAATAAGC GTTATAGGCG CAGGTAGCGT TGCGTGGAGC 
TCTAAGCTCG TACACGACCT CCTGCACACG CCCTCCCTGT ACGGTAGCGA CGTGTACTTC
ATGGATATAA ACGAGGAGAG GCTACGCATC CTCAGGGGTC TCGCGGAGAA GTACAAGTCG
GAGGTCGGGG CGGAGTACAA CTTCTTCTAC ACGACGGACA GGAAGGAGGC TGTGAGGGAC
TCGGACGTCG TGATAAACAC GGCGATGTAC GGTGGGCACA GGTACTACGA GCTTATGCGC
GAGGTTAGCG AGAAGCACGG CTACTACCGG GGCGTGAACA GCGTTGAGTG GAACATGGTG
AGCGACTACC ACACGATATG GGGCTACTAC CAGTGGAAGC TCGCGATGGA CATAGCGAGG
GACGTCGAGG AGCTGGCGCC GGGCGCCTGG CTGATCCAGA TGGCGAACCC CGTCTTCGAG
CTTACAACAC TGATTTCGCG GGAGACGAGG GTCAAGGTCG TAGGGCTGTG CCACGGGCAC
CTGGGCTACA AGGAGATAGC GCAGACGATA GGGCTCGACC CGGCGAGGGT GGGCTTCGAG
GCTATAGGCT TCAACCACGT GATATGGCTG ACGAAGTTCA CCTACGACGG GGAGGACGCG
TACCCGCTCA TAGACAGGTG GGTAGAGGAG AAGGCGGAGC AGTACTGGTC GCAGTGGAGG
ATGAGGCAGG TAAACCCCTT CGACATACAG ATGTCCCCTG CGGCTGTCGA CATGTACAGG
CGCTACGGGC TCTTCCCGGT AGGCGACACC GTCAGAGGAG GTACCTGGAA GTACCACTCG
AGCCTCAAGA CGAAGCAGTA CTGGTACGGC CCTACGGGCG GCCCCGACAG CGAGATAGGC
TGGGCTCTCT ACACCGCGCA CCAGGAGTAC TGGCTGGCAA CGCTGGCCCA GGCGGCCTCC
GACCCGAGGA TACCGGTCTC CTCCCTCTTC CCGCAGACCA GGACCGAGGA GAGTGTCGTC
CCGCTAATCG AGAGCCTCAT GCTCGACAAG CCGGGCGAGT ACCAGGTCAA CGTGCTCAAC
GGCAACGCGA TAGAAGGCAT ACCCAGCAAC GTGGCAGTCG AGGTGCCGGC CAGGGTGGAC
GCCCGCGGTA TACACCCGAA GACCGGGCTG AGGCTACCGA GGAAGATACT CTCGCTCGTC
ATGCAGCCCA GGCTGCTCCG CGCGGAGATG GCGATAGCCG CCTTCCTGGA GGGCGGGAGG
CAGTTCCTTA TAGACTGGCT CATGCTGGAC CCGAGGACGA GGAGCGAGGA GCAGGCAGAG
AAGGTTTGGG AGGAGATACT ATCGCTACCC GGGAACGAGG AGATGAAGAG GCACTACAGC
TAG
 
Protein sequence
MSRVPKIKIS VIGAGSVAWS SKLVHDLLHT PSLYGSDVYF MDINEERLRI LRGLAEKYKS 
EVGAEYNFFY TTDRKEAVRD SDVVINTAMY GGHRYYELMR EVSEKHGYYR GVNSVEWNMV
SDYHTIWGYY QWKLAMDIAR DVEELAPGAW LIQMANPVFE LTTLISRETR VKVVGLCHGH
LGYKEIAQTI GLDPARVGFE AIGFNHVIWL TKFTYDGEDA YPLIDRWVEE KAEQYWSQWR
MRQVNPFDIQ MSPAAVDMYR RYGLFPVGDT VRGGTWKYHS SLKTKQYWYG PTGGPDSEIG
WALYTAHQEY WLATLAQAAS DPRIPVSSLF PQTRTEESVV PLIESLMLDK PGEYQVNVLN
GNAIEGIPSN VAVEVPARVD ARGIHPKTGL RLPRKILSLV MQPRLLRAEM AIAAFLEGGR
QFLIDWLMLD PRTRSEEQAE KVWEEILSLP GNEEMKRHYS