Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0791 |
Symbol | |
ID | 4601140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 743342 |
End bp | 745138 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639773568 |
Product | glycoside hydrolase family protein |
Protein accession | YP_920196 |
Protein GI | 119719701 |
COG category | [S] Function unknown |
COG ID | [COG1543] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACTCTCC ACTTTAGTGG TCCTCTCTTG ATGTACTGGC GTGAGCTTTA CCCCGACTTC CTCGCTAGGC TTAGAGAGAC TGTCTCGAGG AGCGAGTTCG AGGTTCTCGG CGGTACGTAC TCTGAAAGCG TGCTGTCGCT GCTTCCGTGG GAGGACAGGG TCTTGCAGCT CAAGAAGGGG CGCGAGCTGG TAGAGGAGAC CTTGGGGGTC TCTCCCCGGG GTCTCTGGAT CCCCGAGAGG GTCTGGGACC CCACGCTTCC CCCCGCGATA AGCGAGGCTG GGTACAGCTA TGTGATAGTA GACGACGAGG TCGGGTATAG GTCTGGTCTT TGGAAAGACG ACGTCCACAG GGCCGTCTTG ACGGAGTACA GCGGGAGGAG GGTGGGAGTG CTCTTTATCG ACGGGCCCGT GAGGTACATA CTGCCGTGGA AAGCACCCGG AGAGGTTCTA GGCTACATAA GATCCTTCGC CACGGAGGAC GGGCGACTCT ACGTGCTGTG GGGCTCCGAC GCCGAGAAGT TCGGAGAGTG GTGGGACGCG AGGGCCGCCG AGCAGTGGCT ACGCACGTTC TTCGGCATGC TTAAGGGGGA CTCGTCCGTG GCGCTCCTAA CGCCGTCCGA GTACATCCTG AGGCACGGGT ACCAGGGGCT CGCGTACCTC GCCCCGGGGA GCTACGACAA GATGATGGAG TGGAGTGGAG GCTACTTCCC CAACTTCCTG AGGAAGTACA GGGAGACTAA CAACATGCAC AAGAAGATGC TGTACGTGAG GGGAAAGCTA TCCCTTCTCA AGGCGTCGAG GGAGGCGTGG GAGGAGTACC TCAAGGCGCA GTGCAACGAC GCTTACTGGC ACGGGCTGTT CGGAGGAGTC TACATACCGT TCCTCAGGCA GGCGGTGTTC GAACACTTGG TAAGGGCGGA GCGCTTAGCC GAGGAGGAGT CCGGCTACTA CCTGGGGAAC TCCTCCGTCG TTAGAAGCCT GGACTTCGAC TTCGATGGCG TGGACGAGGT GCTCATCGAG GAGAAAGAGG TCAACGCCTA CGTAAAGCCG AGCGACGGAG GCTCTCTCTT CGAGCTAGAC GTAAAGATGC CGGGGAAGGA GCACAACCTT CTCGCCACGA TGTCGAGGTA CAGGGAGCCG TACCTAGAGG ATCAGAAATC GGTTGTCCCC GACTGGTACA GGCGCGTCGC CTTCAGGGAG CACATCTGGC GCAAGGACGC CTCGAGCGCC GACTGGATTA ACAACACGCC GTTCGTAGAC GTGAGCGACT TCGCCCTGGG CAACTACATC GTAGAGGCTG TAGAGGGCAA CAAGCTAGTG CTCTCCTTTA CGGGTAGGGA CTGGAGCGAC AGGAGGAGGC CGGCGCGGAT ACACCTCGTG AAGACCTACG AGGTGCTGGG CTCCCAGAGG ACTGTCAGGG TGCGCTACAG GTGGCGCAAC ATGGAGAGAA GGTTCATAGA CCCCAAGCTC TCGGTGGAGG TCAGCCTGTT CCCCAGGCTG AGCTACGAGG AGGACTCCGA CCCTACGTAC ACCGTGGACG GCTCGCAGAG GTTGAGCGTG AGGGAGGGCT TCTCCTCTCC CTGGGCCAGG ACGGTGAGGG TAGAGTCGCC GGCCTTCCCG ACGGTCACCG TTGAGAGCTC GAGGCACGCA GAGGTATGGG TATCCCCCAT CCTCAGCTGG TACAGGACGG AGAAGGGGTT GCGGAGCGAG TACCAGGGGC TAGCCGTTTC CTTTAACTAC GCGGTAGCGC TAAACCCAGG GGAAACCTTC GAAACGGAGG TGTCCCTGTC TTGGTGA
|
Protein sequence | MTLHFSGPLL MYWRELYPDF LARLRETVSR SEFEVLGGTY SESVLSLLPW EDRVLQLKKG RELVEETLGV SPRGLWIPER VWDPTLPPAI SEAGYSYVIV DDEVGYRSGL WKDDVHRAVL TEYSGRRVGV LFIDGPVRYI LPWKAPGEVL GYIRSFATED GRLYVLWGSD AEKFGEWWDA RAAEQWLRTF FGMLKGDSSV ALLTPSEYIL RHGYQGLAYL APGSYDKMME WSGGYFPNFL RKYRETNNMH KKMLYVRGKL SLLKASREAW EEYLKAQCND AYWHGLFGGV YIPFLRQAVF EHLVRAERLA EEESGYYLGN SSVVRSLDFD FDGVDEVLIE EKEVNAYVKP SDGGSLFELD VKMPGKEHNL LATMSRYREP YLEDQKSVVP DWYRRVAFRE HIWRKDASSA DWINNTPFVD VSDFALGNYI VEAVEGNKLV LSFTGRDWSD RRRPARIHLV KTYEVLGSQR TVRVRYRWRN MERRFIDPKL SVEVSLFPRL SYEEDSDPTY TVDGSQRLSV REGFSSPWAR TVRVESPAFP TVTVESSRHA EVWVSPILSW YRTEKGLRSE YQGLAVSFNY AVALNPGETF ETEVSLSW
|
| |