Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0698 |
Symbol | |
ID | 4602013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 648295 |
End bp | 649722 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639773472 |
Product | hypothetical protein |
Protein accession | YP_920103 |
Protein GI | 119719608 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.482742 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAGGG AAATGATGGG GTGTGAGAAA AACGTGGCGG CTGAGGAGGC AGGGTTCGAA GAGTTCGACG AGTGGTTCAT TCGCAGGAAG ATCGAGGAGC TCCTGAGGAA GGCGCTGTTT CCGAGCGACG GCTATAGGGG GCTCGAGGCT ACTCGCAACG AGGTTGCGCG GGAGCTTGCG GAAATAAGGG GCAACTATGC TCGGTACGGC GAGGCCTACG TAGAATACGT TGGCAGGCTC GGGGAAGGCG AGGAGAAGGC GATCAAAGAG GCTGAGAAAG CGATGAGGGA AGCCCTGGAG CACGTAGACG AGCTCGAAAT CAGGAGGACA GGCAGAGCGC ACTGGAAGGC TAAGCTTCCA GGCAGGAGTT GGAGGCTATA CGTACACTTG AAGCCTAGCG GGTACTGGGA GGTCGAGATC CGTTTACATC TCAGAGTCGT CAAGCTCAGG CTACCCGATA CCCTGAGGCT CCCACCTGAG CTACTTAGAG CCGCCCAAGA GGGCTGGATC ATGGGAGACG CATCGTACCG CGCGGACCGC GAAGAAGTCA CGATGACTAC AGCGCAGACA TGGCAAGTAG CCTCCTTCCC TGGCTTCTGG CCGGGGAAGG AGGTGGCTGT CTACGTTGAT AGAATCGAGA TCCACGAGAC GAGAGTCAGC GTTAAGTGGT ACGTGGTTGT TAAGGGTGTT CGCGACGCGC CTCGCTGGTG GAGTCTCTCC AAAAAGGAGA AGCAGGGTAT TATCTTGGCG GAGATCGAGG CGGCGAACAG AGGAGAAATC GACATTTTCA GGGCGCTAAA GCTCGCATTG CTATACGTTA CGGATGGAAT GTATCCAGGG TCAAGCAACA CAGCTAGACA TGTGCTGGAT TTCGCGGTCG GTCAAAATTC TCGCCGAGTT AGAACCGAGG GTGCGGTGAA GGTTGCGAGG CTTCTCTACG AGAAAGTACC GCAACTCTTA GCGTACATGG TTGCGTCAGG CTGCCAGAAA GCAGAGTTCT TAGCGAGCCT GGCATCCGTG AAGCCGCGAC ACTACGCACC TCGCTACCTA GAGGTTGCGG GTGTCAAAAT GACCTTGCTA CTCGTAGGCG CTTCACGCGC GCTTGCGGCA GTGGTGTACG TAACCGAGGA TAACGAGGAG ACGCTTAGGG GTTTCCCTGA GAGGGCGAGG CGGGAGGGCT TAGAAGTCAG GAAGGTGAAG GTGAGCAAGG GGCGCTGGGG TTATCGCGCC GGGCAAAAGG AGTTGCTAAG GTATGCCGAT AAGCGCCCAA TAGTTTACGA CACACTCATA GCGTTCGTGG AGGAGAGACT CGGAGCAATG CCTCTCAATC ACCCAGCAAG GCCGAGTGTG GAGCGCCTCC TGGAACGCCT AAAAAAGGCG CGCGAAAGAG CGCTTAGAAA GCTGGGGGAC CAAGACGCTA AAGAGTAA
|
Protein sequence | MGREMMGCEK NVAAEEAGFE EFDEWFIRRK IEELLRKALF PSDGYRGLEA TRNEVARELA EIRGNYARYG EAYVEYVGRL GEGEEKAIKE AEKAMREALE HVDELEIRRT GRAHWKAKLP GRSWRLYVHL KPSGYWEVEI RLHLRVVKLR LPDTLRLPPE LLRAAQEGWI MGDASYRADR EEVTMTTAQT WQVASFPGFW PGKEVAVYVD RIEIHETRVS VKWYVVVKGV RDAPRWWSLS KKEKQGIILA EIEAANRGEI DIFRALKLAL LYVTDGMYPG SSNTARHVLD FAVGQNSRRV RTEGAVKVAR LLYEKVPQLL AYMVASGCQK AEFLASLASV KPRHYAPRYL EVAGVKMTLL LVGASRALAA VVYVTEDNEE TLRGFPERAR REGLEVRKVK VSKGRWGYRA GQKELLRYAD KRPIVYDTLI AFVEERLGAM PLNHPARPSV ERLLERLKKA RERALRKLGD QDAKE
|
| |