Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0839 |
Symbol | |
ID | 4601548 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 789356 |
End bp | 790315 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639773616 |
Product | xylose isomerase domain-containing protein |
Protein accession | YP_920243 |
Protein GI | 119719748 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1082] Sugar phosphate isomerases/epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.276132 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGAGGC GAAAATACAC GATTCGCATG TTCGCAGTTT CTCTCAGAAA GAGAGGCGTA GACGCCGTGT ACTCTTTCCC CTGGAAAGTT AGTATCGTTG CGTTTATGGC CAACCCGCCC GTACTTAAGG AGGATATTCA GGCCTTAAAG GAGACATTCT CCACTCTCTC CTCGGACAGG TTCTTCGACG TGATAGAGGT TCAGCTCCTC GGCGACGAGG CTTGGAGGGC TGTTGAGGGG TTTGTGAAGG CAAGCGGCGT GGAGGTTGCG AGCGGGGTCC AACCCCTCGT ATTGATGCAG GGCTTTAACC CCTCTTCCCT CGTGGAAGCT GAGAGAAAGA AAGCTGTAAG CAAGCTCGTC GAAGTAGTGA AGACCTCGGC AGAGAGGGGT ATAAGCAAGG TTGCATTTTC CAGCGGGCCG GACCCGGGCC CCGAGAACCG CGAAGCAGCC AAGGACGCCC TGATTAAGAG CTTGAAGGAG ATAGCGGGCG AGGCGAAGAA GTTCGGGGTA ACAGTCATAC TGGAGACCTT CGACAGGGAC TGGGACAAAA AGCAACTCAT AGGGCCTATC AGGGAGGCGG TAGAAGTGGC AGAGAGCGTA AGGGAGGAGC ACGGAAACTT CGGCCTGCTC TGGGATCTGA GCCACGCCCC CATGCTCGGC GAGAAGCCCT CCGACCTCAA GTTAGCGAAG AGCTACCTGG CGCATATCCA CATAGGGTGC GCTAAGCGCC TACCCGACGG GCGGCTAGTG GACTGGCACC CGGGCTTTTA CAGGCCCGGA GCAGTCAATG GGGTCGAAGA CGTGAAGGAG CTCCTCAAAG TACTGCTCGA GATAGGGTAC ACAGGGGCCG TGGGCTTCGA AGTAAAGCCG GAGGAAGGTC AGCACTGGCG CGAACCCGTG GAGGCCGCGA AGGGCGTGCT CTACGAGGCA TTCCTCAGGC TCGTTGAGAG CGAGCTCTAA
|
Protein sequence | MPRRKYTIRM FAVSLRKRGV DAVYSFPWKV SIVAFMANPP VLKEDIQALK ETFSTLSSDR FFDVIEVQLL GDEAWRAVEG FVKASGVEVA SGVQPLVLMQ GFNPSSLVEA ERKKAVSKLV EVVKTSAERG ISKVAFSSGP DPGPENREAA KDALIKSLKE IAGEAKKFGV TVILETFDRD WDKKQLIGPI REAVEVAESV REEHGNFGLL WDLSHAPMLG EKPSDLKLAK SYLAHIHIGC AKRLPDGRLV DWHPGFYRPG AVNGVEDVKE LLKVLLEIGY TGAVGFEVKP EEGQHWREPV EAAKGVLYEA FLRLVESEL
|
| |