Gene Tpen_1675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1675 
Symbol 
ID4600927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1620717 
End bp1622258 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content45% 
IMG OID639774448 
Productglycoside hydrolase family protein 
Protein accessionYP_921073 
Protein GI119720578 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.256832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACAA TGATGTTTCC GCAAAAATTT AGGTGGGGAG TCTCCGAGTC AGGTTTTCAG 
TTCGAGATGG GAGACGAGTA TAGGCGCTTC ATAGACACAA ACACCGATTG GTGGCACTGG
GTACGTGACC CTCACAACAT CTCTTCCAGA CTTGTCAGTG GAGATCTGCC AGAAGACGGT
ATAAACTACT TCGAACTATT TGGGAAGGAT CATGAATTGG CAAGGGAGCT GGGACTTAAC
ACTTACAGGC TGGGAATAGA GTGGAGCAGA ATCTTTCCAC ATCCGACCTG GTTTATAGAG
GTAGATTTTG AGAAGGATTC CTTGGGTTTC GTTAAGAGCG TAAGGATAGA CGAGGATACC
CTTAGGGCTC TGGACAGGTA TGCTTGTCGG AAGGCCGTGC AGATGTACAG AGAAATACTG
CTTGACTTAC GCAAAAGAGG ATTCAAAGTA ATCGTAAACC TTGTACACTT TACGTTACCT
TACTGGATTC ACGACCCTAT AAGGGCGAAG TCTTCTGAGC TTTCTGAGGG ACCTTTAGGC
CTCCTCGAGG AAAGCTTCCC TATTGAGATG GCGAAGTACG CAGCTTACGT TGCATGGAAG
TTTGGCGACT TAGTAGATAT GTGGTCCACG TTTAATGAAC CAGTAGTACC TATAGAGCTA
GGCTATCTAG GCACTTACAC AGGTTTTCCT CCAGGTGTCA ATAAGCCCCA GGCAGTTCCT
AAAGCATTGG TAAACACTGC TATCGCTCAT GCCTTAGCTT ACGATATGAT AAAGAAGTTT
GACAATGTAA AGGCTGATCC CGACTCTAAT TCTCCGGCAG AGGTGGGTCT AATCTACAAT
ATAATTCCAG CATACTCTCC CGAGGGAACA AAATCAGAAA AAGCCGTTGA GCATTATTCA
TATTTCCATA ACGAGTTGCT ACTAGAGGCC GTAAAGAACG GCAGATTAGA TGTCGCGCTT
GACGGCAAAA ATATCCTTAA ACCCGCTGCT CTAGGCGGGA AGCTTGACTG GTTAGGTGTT
AACTACTACA CTAGGATTGT TGTCAAAGAG TCTTCTCGTC GCTTTAACGG ACATCCAGTA
TTAGACTTCG AAGCAGTGGC TGGTTACGGA TACGCTTGTG TTCCGTTCGG ACTCTCGAAG
ATTGGAAGAG CTTGTGATGG AATGGGGTGG GAGTTCTATC CAGAAGGGCT TATTGATGCA
TTGAGGATTG GGTCGACCTA CGCGAGTAAG CTTTTAGTCA CGGAGAACGG CACCTCCGAT
CCTAGGGACG TAATACGTCC CAGTTATCTC GTAAACCATC TGTATGCATT ATTACTAGCC
ATAGAGGAAG GAATAAATGT AGAAGGTTAT TTGCATTGGG CGTTAACGGA TAACTACGAG
TGGGCTCATG GTTTTCGGCA ACGTTTCGGT TTGTTCGAAG TAGACCTCAT CACGAAGAGT
AGAATTCCAA GGCATTCTTC AAGGATTTAT AAGCATATAA TCCAGCAAGG TTTTATACCA
AGTGAGTATA AGAAAGACAT CGTCGAGTTT AGGGGGATAT AG
 
Protein sequence
MSTMMFPQKF RWGVSESGFQ FEMGDEYRRF IDTNTDWWHW VRDPHNISSR LVSGDLPEDG 
INYFELFGKD HELARELGLN TYRLGIEWSR IFPHPTWFIE VDFEKDSLGF VKSVRIDEDT
LRALDRYACR KAVQMYREIL LDLRKRGFKV IVNLVHFTLP YWIHDPIRAK SSELSEGPLG
LLEESFPIEM AKYAAYVAWK FGDLVDMWST FNEPVVPIEL GYLGTYTGFP PGVNKPQAVP
KALVNTAIAH ALAYDMIKKF DNVKADPDSN SPAEVGLIYN IIPAYSPEGT KSEKAVEHYS
YFHNELLLEA VKNGRLDVAL DGKNILKPAA LGGKLDWLGV NYYTRIVVKE SSRRFNGHPV
LDFEAVAGYG YACVPFGLSK IGRACDGMGW EFYPEGLIDA LRIGSTYASK LLVTENGTSD
PRDVIRPSYL VNHLYALLLA IEEGINVEGY LHWALTDNYE WAHGFRQRFG LFEVDLITKS
RIPRHSSRIY KHIIQQGFIP SEYKKDIVEF RGI