Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1297 |
Symbol | |
ID | 4600651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1239362 |
End bp | 1241035 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639774073 |
Product | glycoside hydrolase family protein |
Protein accession | YP_920698 |
Protein GI | 119720203 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00968279 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGGGG AGCCGGTCGT TGTGCCCGAG CCGGCGAGGC TCGAGTTCGG GGGGCGCTGG TTCAGCTTCG ACGGCTTCTC GAACCTCGAC CCCTTCGTAG CCGAGGAGTT CCGCCTGCCC AGGGGGGAGT GGGAGGTTCG GAGGGTTGAG GGGGAGGGGA CGGGCCTCGA GGTCAAGGAA GGCTTCGTCG AGGTGTGGGG GGATCCGAGG GTCTACACGG CGACGTTGAT CCAGCTGGTC ATTCAGGGGG GTTACCGCGC CATCCCGGAG GTCAGGGTCG AGGAGAGGTT GCGCTTCGAG TTCCGGGGCT TCCACCTAGA CGTTGCGCGC GGCGGCGTGG CGACTGTGGA GGAGCTGAAG AGGCTTCTGC GCTGGCTCTT CCTCCTGAAG TACAACTACC TGGCCGTCTA CGTCGAGGAC CTCTTCCCCT GGGACCGCTA CCCGGACATC GGGGCGAGGC GGGGCAGGTA CACGGGCAAG GAGTGGAGGG AGGTTGTGGA GTACGGGGGT AGGCTCGGGG TGGAGGTCTT CCCATCGCTC GAGCTGGCCG GGCACATGGA GAACATCCTG TCCCTGCCGG GGTACCGGAG GTTCAGCGAG TGGCATAGGC CGGAGGAGGG GTGCCTGGAC GTGGGCGACC CCGAGGCTAG GAGGTTCGCG GAGGAGTTGC TGGAGGAGGC GCTCGAGAAG ACGAAGTCGA GGTACATCCA CATAGGCGGC GACGAGACCT GGGCGATGGG GAGGGGGAGG AGCCTGGACA GGACCCTGAG GTTCGAGTGG CCGCGGCTCT ACGCCGAGCA CCACTCCAGG CTGCTTCGGC TGGCCAGGGA GCGCGGCAAG ACCCCGCTGA TCTGGGGCGA CATGCTCGCC GGCATGTACC TGCGCGAGAC CGAGAGGGAT CTGTGGAGGC CGATCCTCGA GAACCCGGCG TGGAGGGAGG CCGTGGTGGC GAACTGGGAC TACTCGCCCG GCACCGTCGA GTACTTCAAG CAGAAGATCC GGCTCTTCAA GGAGAGAGGC TACGAGCAAA TCGTCTGCCC GGGCCTCTGG AACTGGGACA GGTACTACCC CGACTTCGAC GCCGCCCTGG CGAACGTGAA GAGCTTCCTG CAGGCGGCGC GGGAGGAGGG AGTCAAGGGC TTCATGGTGA CCGCGTGGGG CGACGACGGG GAGGAGTGCC TCTACTCCTT CCTGTACCCG CTCATCCTGG CCTCGATGGA GTACGCCGAG GGCAACGGCA GGTGGGAGGA GAAGTGGCTC GCGCTCAGCG GGGAGCCGCG CGAGGTGCTG GAGGTCAGGA AGGCGCTCGG CAAGGGCGAG GTCGCCAACT ACGTCAAAAG GGTGCTCTTC TCCCCCACGG ACGAGGTGAA GGGCCTCCCG GTCTTCGACG AGTGGAGGAA GGCGCTCGAG CTGGCGGAGA GAGTCAGGCT CCCACCAGAC CTCGAGTTCG TGAAGCGTTG CCTGGAGGTG GGCCTGAGGA AGGTGGAAGG AAAGGCCACG GCAGCCGACC TCCTGGGGCT CGCCAGCCTC TACGCGGACC TCTGGCTAAG GGAACGCAAG CCAGCGAACC TCGCCAGGGT CTACGCCCGC TTCTACAGCG CCGCCGCCTT AGGCCGCGCG CCCAAAGCCC GGCGCAGAGC AAAGCCGGCC TCGTCACCGC ATAAAAGGGT TTAG
|
Protein sequence | MGGEPVVVPE PARLEFGGRW FSFDGFSNLD PFVAEEFRLP RGEWEVRRVE GEGTGLEVKE GFVEVWGDPR VYTATLIQLV IQGGYRAIPE VRVEERLRFE FRGFHLDVAR GGVATVEELK RLLRWLFLLK YNYLAVYVED LFPWDRYPDI GARRGRYTGK EWREVVEYGG RLGVEVFPSL ELAGHMENIL SLPGYRRFSE WHRPEEGCLD VGDPEARRFA EELLEEALEK TKSRYIHIGG DETWAMGRGR SLDRTLRFEW PRLYAEHHSR LLRLARERGK TPLIWGDMLA GMYLRETERD LWRPILENPA WREAVVANWD YSPGTVEYFK QKIRLFKERG YEQIVCPGLW NWDRYYPDFD AALANVKSFL QAAREEGVKG FMVTAWGDDG EECLYSFLYP LILASMEYAE GNGRWEEKWL ALSGEPREVL EVRKALGKGE VANYVKRVLF SPTDEVKGLP VFDEWRKALE LAERVRLPPD LEFVKRCLEV GLRKVEGKAT AADLLGLASL YADLWLRERK PANLARVYAR FYSAAALGRA PKARRRAKPA SSPHKRV
|
| |