Gene Tpen_1297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1297 
Symbol 
ID4600651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1239362 
End bp1241035 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content67% 
IMG OID639774073 
Productglycoside hydrolase family protein 
Protein accessionYP_920698 
Protein GI119720203 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00968279 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGGGG AGCCGGTCGT TGTGCCCGAG CCGGCGAGGC TCGAGTTCGG GGGGCGCTGG 
TTCAGCTTCG ACGGCTTCTC GAACCTCGAC CCCTTCGTAG CCGAGGAGTT CCGCCTGCCC
AGGGGGGAGT GGGAGGTTCG GAGGGTTGAG GGGGAGGGGA CGGGCCTCGA GGTCAAGGAA
GGCTTCGTCG AGGTGTGGGG GGATCCGAGG GTCTACACGG CGACGTTGAT CCAGCTGGTC
ATTCAGGGGG GTTACCGCGC CATCCCGGAG GTCAGGGTCG AGGAGAGGTT GCGCTTCGAG
TTCCGGGGCT TCCACCTAGA CGTTGCGCGC GGCGGCGTGG CGACTGTGGA GGAGCTGAAG
AGGCTTCTGC GCTGGCTCTT CCTCCTGAAG TACAACTACC TGGCCGTCTA CGTCGAGGAC
CTCTTCCCCT GGGACCGCTA CCCGGACATC GGGGCGAGGC GGGGCAGGTA CACGGGCAAG
GAGTGGAGGG AGGTTGTGGA GTACGGGGGT AGGCTCGGGG TGGAGGTCTT CCCATCGCTC
GAGCTGGCCG GGCACATGGA GAACATCCTG TCCCTGCCGG GGTACCGGAG GTTCAGCGAG
TGGCATAGGC CGGAGGAGGG GTGCCTGGAC GTGGGCGACC CCGAGGCTAG GAGGTTCGCG
GAGGAGTTGC TGGAGGAGGC GCTCGAGAAG ACGAAGTCGA GGTACATCCA CATAGGCGGC
GACGAGACCT GGGCGATGGG GAGGGGGAGG AGCCTGGACA GGACCCTGAG GTTCGAGTGG
CCGCGGCTCT ACGCCGAGCA CCACTCCAGG CTGCTTCGGC TGGCCAGGGA GCGCGGCAAG
ACCCCGCTGA TCTGGGGCGA CATGCTCGCC GGCATGTACC TGCGCGAGAC CGAGAGGGAT
CTGTGGAGGC CGATCCTCGA GAACCCGGCG TGGAGGGAGG CCGTGGTGGC GAACTGGGAC
TACTCGCCCG GCACCGTCGA GTACTTCAAG CAGAAGATCC GGCTCTTCAA GGAGAGAGGC
TACGAGCAAA TCGTCTGCCC GGGCCTCTGG AACTGGGACA GGTACTACCC CGACTTCGAC
GCCGCCCTGG CGAACGTGAA GAGCTTCCTG CAGGCGGCGC GGGAGGAGGG AGTCAAGGGC
TTCATGGTGA CCGCGTGGGG CGACGACGGG GAGGAGTGCC TCTACTCCTT CCTGTACCCG
CTCATCCTGG CCTCGATGGA GTACGCCGAG GGCAACGGCA GGTGGGAGGA GAAGTGGCTC
GCGCTCAGCG GGGAGCCGCG CGAGGTGCTG GAGGTCAGGA AGGCGCTCGG CAAGGGCGAG
GTCGCCAACT ACGTCAAAAG GGTGCTCTTC TCCCCCACGG ACGAGGTGAA GGGCCTCCCG
GTCTTCGACG AGTGGAGGAA GGCGCTCGAG CTGGCGGAGA GAGTCAGGCT CCCACCAGAC
CTCGAGTTCG TGAAGCGTTG CCTGGAGGTG GGCCTGAGGA AGGTGGAAGG AAAGGCCACG
GCAGCCGACC TCCTGGGGCT CGCCAGCCTC TACGCGGACC TCTGGCTAAG GGAACGCAAG
CCAGCGAACC TCGCCAGGGT CTACGCCCGC TTCTACAGCG CCGCCGCCTT AGGCCGCGCG
CCCAAAGCCC GGCGCAGAGC AAAGCCGGCC TCGTCACCGC ATAAAAGGGT TTAG
 
Protein sequence
MGGEPVVVPE PARLEFGGRW FSFDGFSNLD PFVAEEFRLP RGEWEVRRVE GEGTGLEVKE 
GFVEVWGDPR VYTATLIQLV IQGGYRAIPE VRVEERLRFE FRGFHLDVAR GGVATVEELK
RLLRWLFLLK YNYLAVYVED LFPWDRYPDI GARRGRYTGK EWREVVEYGG RLGVEVFPSL
ELAGHMENIL SLPGYRRFSE WHRPEEGCLD VGDPEARRFA EELLEEALEK TKSRYIHIGG
DETWAMGRGR SLDRTLRFEW PRLYAEHHSR LLRLARERGK TPLIWGDMLA GMYLRETERD
LWRPILENPA WREAVVANWD YSPGTVEYFK QKIRLFKERG YEQIVCPGLW NWDRYYPDFD
AALANVKSFL QAAREEGVKG FMVTAWGDDG EECLYSFLYP LILASMEYAE GNGRWEEKWL
ALSGEPREVL EVRKALGKGE VANYVKRVLF SPTDEVKGLP VFDEWRKALE LAERVRLPPD
LEFVKRCLEV GLRKVEGKAT AADLLGLASL YADLWLRERK PANLARVYAR FYSAAALGRA
PKARRRAKPA SSPHKRV