Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0938 |
Symbol | |
ID | 4601067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 889347 |
End bp | 890756 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639773716 |
Product | hypothetical protein |
Protein accession | YP_920341 |
Protein GI | 119719846 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [R] General function prediction only |
COG ID | [COG0492] Thioredoxin reductase [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.903873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGAATAA CGGAGCATCC CATACTGGAG TTCAGGCGGG GCTCCCCCGT AAAGTTCCGC TTCGACGGCG AAGAGGTAGA GGCTTTCGAG GGGGAAAGCA TAGCGGCGGC TCTCTGGGCT TCGGGGATCA GGGATTTCCG GCGCGGGGAG CAAGGGCCCC AGGGCCCTTT CTGCATGATA GGCTACTGCT CGGGTTGCAT GGTGCGCGTA GACGGCAGAA GCAGGGTTAG GGCTTGCCTC GAGCCCGTGA GGGACGGCGC TGTCGTCGAG AGGGAGGATA AGCCGCTACC GTCTGGGGTT GTCGGGGAGG CGGGCGAGGC GGGCGAGCTG GACGTCGACG TGATGGTAAT TGGTTCCGGG CCTGCCGGTC TCTCGGCCGC CCTCGCCAGC GCTTCCGCCG GCCTGGAGGT ACACGTGTTC GAGCGGCATT TCCGACCCGG CGGCCAGCTC GTGAAGCAGA CGCACAAGTT CTTCGGGAGC GGCGAGCTCT TCGGGGGGTT GAGGGGCTTC CAGATAGCGG AGAGGCTCGT ATCCGAGGCG GAGAGGGCTG GCGTGAAGAT CCACACGAGG TCCCCCGTCC TGGGCTGGTT CGGGGAGGGG GTCTTCGCGG TGAACGAGGG CGGGAGGTTG CTCAGGGTTA GGGCTAAGGC AGTCGTCGTG GGCACGGGGG CTGTCGAGAG GTTCCTGCCC TTTCCGGGGA ACACGCTTCC AGGCGTGATG GGCGCCGGCG CTGCCCAGAC GCTGATGAAC GAGTACGGGG TAAAGCCCGG CGAGAAGGCG GTGGTCGTCG GCGCGGGGAA CGTCGGGCTG ATAGTCTCCT ACCAGCTCCT CCAGGCCGGC GTAAGCGTAG AGGCCGTCGT GGAGGTTAGG CCGGAGATAG GAGGCTGGTT CGTCCACGCG GCGAAGCTGA GGAGGCTGGG CGTCCCCATA CTCACGGAGC ACACTGTCGT GAGGGCTGAG GGCAGGGGGA GAGTCGAGAG GGTGGTCATC TCGAGGGTCG GGAAGGACTT CCAGCCGCTG AAGGAGTACG AGAGGAGCGT AGAGGCGGAC CTCCTGCTCC TAGCCGTGGG GCTGACCCCT GAGTCCAGGT TGCTCGCGGA GATGGGTGCC AGGATGACGT GGTCGACCGA GCTTGGAGGC TACGTGCCGT ACCGGGACAG GTACATGGAG ACCAGTATCC CCGGGGTGTA CGTGGCGGGG GACGCCTCGG GGATCGAGGA GGCTACGACC GCCCTGCTGA CGGGCAGGGT GGCGGGGCTC TCGGCCGCGA TAAGGATCCT CGGGGAGAGG GGTGAGCTCG TGGAGGAGAG GGAGAAGGCT CTGAGGATGC TCGACGAGAC CAGGAGGACT CCCTTCTCCG CGCGCGTCGT GGAGGGTATA CGCAGGGTGA GCGTCGGTGT TCAGGCGTAG
|
Protein sequence | MRITEHPILE FRRGSPVKFR FDGEEVEAFE GESIAAALWA SGIRDFRRGE QGPQGPFCMI GYCSGCMVRV DGRSRVRACL EPVRDGAVVE REDKPLPSGV VGEAGEAGEL DVDVMVIGSG PAGLSAALAS ASAGLEVHVF ERHFRPGGQL VKQTHKFFGS GELFGGLRGF QIAERLVSEA ERAGVKIHTR SPVLGWFGEG VFAVNEGGRL LRVRAKAVVV GTGAVERFLP FPGNTLPGVM GAGAAQTLMN EYGVKPGEKA VVVGAGNVGL IVSYQLLQAG VSVEAVVEVR PEIGGWFVHA AKLRRLGVPI LTEHTVVRAE GRGRVERVVI SRVGKDFQPL KEYERSVEAD LLLLAVGLTP ESRLLAEMGA RMTWSTELGG YVPYRDRYME TSIPGVYVAG DASGIEEATT ALLTGRVAGL SAAIRILGER GELVEEREKA LRMLDETRRT PFSARVVEGI RRVSVGVQA
|
| |