Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1672 |
Symbol | |
ID | 4600924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1618086 |
End bp | 1620008 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 639774445 |
Product | glycoside hydrolase family 42 protein |
Protein accession | YP_921070 |
Protein GI | 119720575 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3934] Endo-beta-mannanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.590042 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGACGAAG CGTTCAAGTT CTTACTAGGA GTTAACTACT GGCCTAGACT ATACAACGTA AAAATGTGGA AAGAATGGGA CGAAGAGAGC CTAAAGAAAG ACATAGAAAA AATGAAAGAA CTCGGCGTTA GAGTAGTAAG GATATTTTTA AGGGATATAG ATTTTGCGGA TGAGAGAGGA ATTCCAATTG AAGAGAGTCT GCAGAAGCTA CAAAGATTCC TCGATTTGCT TCACGAGAAA AACCTCCAGG CATTCGTAAC GTTACTCGTA GGACACATGA GCGGAAAAAA CTTCCCAATA CCGTGGACCA GCTTCGACAG CCTTTACACC CCCTCTTCCG TGGAGAAGAC CGCTACTTTT GCGAGAAAAA TAGCAGAAAG GCTGGCATCA CACCCTGCAC TTGCTGGCTG GATACTAAGT AACGAGCTCA GCTTGGTAAA GAGAGCTACG ACAAGAGAAG ATGCCCTCAG GCTACTCGAA GCATTTACAA AAACTATGAA ATCCGTAGAC CCAAATCACA TCGTATCCAG CGGAGACATA CCAGACAGCT TTATGCAAGA AACCCCTAAT GTACGTCACC TCGTCGACTA CGTTGGGCCA CACTTGTACC TATACGACAC CGATCTCGTA CGGCTTGGAT ACTTCTATGG GGCAATGCTT GAACTCTTCT CCAACGCAGG AGACCTGCCA GTCATACTTG AAGAGTTCGG CTTTAGTACA CTACAATTCA GCGAGGAAAG CCACGCGCGA TTTGTGGAGG AAATTCTTTA CACATCTCTA GCTCACGAAG CTTCTGGAGC GTTTATTTGG TGTTTCTCAG ACTTCACAGA AGAGAGCGGT GAGCCATACG ATTGGAGACC TTTAGAGCTC GGCTTTGGTT TACTGAAGAA AGACGGTAGC GAAAAACTAG CAGCAGACTC TTACAGGAAC TTTTCTCATG TGGTCGAAAG AATAGAAAAG CTCGGACTTC ACTCTAAGTA CAAACGTTTA TCAAGCACTT TTGTAGTTTA TCCATTTTAC TTATTCAGAG ACTACGAGTT CATATGGTAC AAAGAGTCAC TAGGCTTTTG GGAATCCATA AAACCGCACT TGATGAGCTA CTCGCTTCTA TCCGCCTCTA GCGTTCCTTC TCGAATGGTT TACGAGCTAG ACCTGAAAAA GATTTTAAAA AGCGCTAAGC TAGTAGTCCT TCCTTCTGTA GTAGCTACAC TTGCTTCTAC GTGGCGCAAC CTTCTAGAAT ATGTAGAGCT CGGCGGAACC CTCTATTCAT CCGTTATCAG GGGAGCCGGT GCTTTCAAAG CCCTCCACGA TGCGCCGACA CACCTTTGGA ACGAGCTGTT TGGCGTGGAA AACGTTCTAG AGGCAGGCTC CATGGGACGC AAGATCTTCG GAGTCGTTAA ATTGAAATTC GTCAGGAAAT TTGGCAACCT CAGTGAGGGG GACGAACTAT TACTAAAAGT ACCAGAAAGC ATCTATACTT TCAAGGCGCA AAGCACGGAC TCGGACGTAA TAGCCTTGGA TGACGAAGGA GAACCGGTAA TCTTCTTCTC TCGAAGGGGG CGTGGCAAGA CCATTCTATC GCTGATACCT ATAGAGGTGA TATTACAGGC ACAGGAAAAC GCTCAATGGC ACGAAGGAAC AATATTTTAT GAACAACTTG CCTTTGTTTC AGAGGTAGAA AGACGTTATG CTTCAAAGGA TCCTAGAGTC GAGCTACAAG TTTACACGGG AGAAAAGGAC GATCTTTTAA TAGTGATTAA TCACAGCAAT GAAAATGTGG AGACAAGCAT TACGAGTGCT ACAAGAATCG TAGAAGCACA AGTAATTGGC GGTAAAGCAA GGCTATTACC AGAGTCGAAG AGAGAGATGA GAGCTGTATT TCCCCCAAAG TCGGGCTCCA TTATTCGTGT CGTTAAAACC TAG
|
Protein sequence | MDEAFKFLLG VNYWPRLYNV KMWKEWDEES LKKDIEKMKE LGVRVVRIFL RDIDFADERG IPIEESLQKL QRFLDLLHEK NLQAFVTLLV GHMSGKNFPI PWTSFDSLYT PSSVEKTATF ARKIAERLAS HPALAGWILS NELSLVKRAT TREDALRLLE AFTKTMKSVD PNHIVSSGDI PDSFMQETPN VRHLVDYVGP HLYLYDTDLV RLGYFYGAML ELFSNAGDLP VILEEFGFST LQFSEESHAR FVEEILYTSL AHEASGAFIW CFSDFTEESG EPYDWRPLEL GFGLLKKDGS EKLAADSYRN FSHVVERIEK LGLHSKYKRL SSTFVVYPFY LFRDYEFIWY KESLGFWESI KPHLMSYSLL SASSVPSRMV YELDLKKILK SAKLVVLPSV VATLASTWRN LLEYVELGGT LYSSVIRGAG AFKALHDAPT HLWNELFGVE NVLEAGSMGR KIFGVVKLKF VRKFGNLSEG DELLLKVPES IYTFKAQSTD SDVIALDDEG EPVIFFSRRG RGKTILSLIP IEVILQAQEN AQWHEGTIFY EQLAFVSEVE RRYASKDPRV ELQVYTGEKD DLLIVINHSN ENVETSITSA TRIVEAQVIG GKARLLPESK REMRAVFPPK SGSIIRVVKT
|
| |