Gene Tpen_1110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1110 
Symbol 
ID4601104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1047833 
End bp1049386 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content58% 
IMG OID639773887 
Productglycoside hydrolase family protein 
Protein accessionYP_920512 
Protein GI119720017 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCCCGA AGAGTTTCCT CTGGGGAGTA TCCCTAGCAG GCTTCCAGTT CGAGATGGGG 
GACCCCGCGG GGGAAGCTTT GGACCCTAAC ACCGACTGGT ACGTGTGGGT ACACGACGAG
TACAACATAA GGGAGGGAAT AGTCAGCGGG GATCTGCCGG AGAAAGGGAT AGACTACTGG
CACCTTTTCA GGGAGGACCA CTCTCTGGCG AAAAGCCTGG GGCTAAACGC CTACAGGCTT
AACGTCGAGT GGAGCAGGGT GTTTCCGGAG CCGACGTTCA GCGTAGAGGT TGGGGTGGAA
GAGGAGGACG GCGTTAAGAC CGGTATAGAC ATCGACGACT CCGACTTAGA GAAGCTGGAC
AGCATTGCGA ACAAGAAGGC GGTGCAACAC TACAGGGAGG TCGTGGAGGA CCTCCGCGAG
AAGGGCTTCT ACGTCATCCT CAACTTGGTC CACTTCACGC TTCCAACCTG GATCCACGAC
CCTCTAACCG CGCGCGCCAC GAACGCGAAG AAGGGGCCAC TGGGCTACGC GGACCCCAGG
TTCCCGGTGG AGTTCGCGAA GTTCGCCGCC TACGTTGCGG CGAGCTTCGG GGATCTCGTA
GACGCGTGGT CAACGTTCAA CGAGCCGAGC GTGGTGACCG AGTCGGGCTT CCTGAAGAGG
AGGGGGAAGT TCCCGCCCGG CATATTCAAC TTCGACGCGT ACAAGCGGGC TATGATCAAC
ATCGCACAAG CACACCTACT GGCGTACATC GCTATCAAGA AGTTCGACAG GGTGAAAGCT
TATTCTGACT CCGCGGAGTC AGCGTCCGTC GGAATTATAC ACAACATGAT ACCGTTCCAC
CCCCTCGACC CCTCCAGGAA GCGCGACCGG GACGCATCTA TGGTAACACA CCACCTCCAT
AACTCCTGGA TCCCGAACTC CCTTGTAAAC GGGTGGATAG ACAGGGACTT CGACCTCAAA
CAGGAGCCCA GCGAAGTATT CGAGAAGTAC AAGTCGAGGC TTGACTGGAT GGGCATCAAC
TACTACTCGA GGTCCGTCGT CAAGGGTAAG GTCAACCTCC TCAGGCCTGT AATCCCGTTC
CCCGCGTTCC CCGTGCTCGT GAAGGGGTAC GGGTTTGAGT GTGCACCGAA CTCTCAGAGC
CTGGCAGGGA GACCTACCAC GGACTTCGGG TGGGAAGTAT ACCCCGAGGG CATAGTAGAG
GTTGTAAAAA TGGCAATGCA GTACAACGTT CCTCTACTCG TAACGGAGAA CGGGGTCGCA
GACGCGCGGG ACGAGCTGAG GCCGCACTTC CTAGCCCTCC ACCTAAAGCT CCTCGAGGAC
GCGTTGGAAA GCCGCGAGAT AAGCCTTAAA GGCTACCTTC ACTGGGCTCT GACGGACAAC
TACGAGTGGG CGGATGGCTT CAGGATGCGC TTCGGCCTAT TCGAGGTAGA CCTCTCCAGC
AAGAGAAGAG TGAAGCGCCC GAGCGCGGAT CTCTTTGCGA GGATAGTCTC GGAGGGGACT
GTCCCAGACG AGGCGGTCAG GAAGGCGAGG GAAAAGCTTT CCGTCAACCT TTAA
 
Protein sequence
MFPKSFLWGV SLAGFQFEMG DPAGEALDPN TDWYVWVHDE YNIREGIVSG DLPEKGIDYW 
HLFREDHSLA KSLGLNAYRL NVEWSRVFPE PTFSVEVGVE EEDGVKTGID IDDSDLEKLD
SIANKKAVQH YREVVEDLRE KGFYVILNLV HFTLPTWIHD PLTARATNAK KGPLGYADPR
FPVEFAKFAA YVAASFGDLV DAWSTFNEPS VVTESGFLKR RGKFPPGIFN FDAYKRAMIN
IAQAHLLAYI AIKKFDRVKA YSDSAESASV GIIHNMIPFH PLDPSRKRDR DASMVTHHLH
NSWIPNSLVN GWIDRDFDLK QEPSEVFEKY KSRLDWMGIN YYSRSVVKGK VNLLRPVIPF
PAFPVLVKGY GFECAPNSQS LAGRPTTDFG WEVYPEGIVE VVKMAMQYNV PLLVTENGVA
DARDELRPHF LALHLKLLED ALESREISLK GYLHWALTDN YEWADGFRMR FGLFEVDLSS
KRRVKRPSAD LFARIVSEGT VPDEAVRKAR EKLSVNL