Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0468 |
Symbol | |
ID | 4600996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 427168 |
End bp | 428466 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639773236 |
Product | hypothetical protein |
Protein accession | YP_919880 |
Protein GI | 119719385 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGGGT ATGAGGAAAT CGAGAAGACT GTTAGGGAGG AGACGTGGAG GAGGGCGGCT TTAAGCCTCT ACGCCACCAG AACAGAGGAT AAGAGGAGGT CCAGGGGTAA GAAGAAGAGG GGCGAGATAC ACTACCGCGG CCTCTACGAT ACGGTGTCTG GGATTAACTG GGACTTCACC AAGTTCCTCG TAAACGGCTA CAGCGTCGTG CCGGACAGCG TTTACCCGAG GTTTTACAGG TTCATCGACT ATGACTTGAG GAAGTATCTC TTGCTGAACG ACGACGAGAA GCCACGCGAA GGAGGCGCGG TAATGGAGCT CAAAGGAAGG TTGCAAGCAA TCGTCGACGC CGGCGCCGAC GGTCTTAGAG CTGAGAAGAA GGGTAAAGTC TGGCATGTAT ACATACCTAG AGAGAACTGG CACGTGAGGG TCTCGAAGCC TACGCATGGC TGGTCCGTAC ACATCCCATT GGAGGGCTTC TGGGTCGAAT CGGAGTTTCC CCAGGTTCTA GTGAATACTC CGAGCGATGT TCTCAGGAGC CTGCAGAAGG GGTGGATCCT TACGGATGTG ACACCCCCTC ACGGGCGCTA CAGCGACGTA CGTTTCGGTA CCACTCAACC GTGGCAGTTG CCAGCGACGC TCGCAACCTT TCCAAGCGAC GATGCCAGGC TCGGCGTTAC GGCGGGCATA CTCGGTAGCA CCAGGCTGAG CTTTCAGTGG CAGGTGCGTG TCTACGGTTA CGAGGAGGAG CTGGGCTGGG CCTCCACGCT CATTGGCGCT ACGAAGCGCG TGGAGTTCCG CAGGTTGGTC GAGGAGTGCA AGGAGCTCAA CGGCGACCCA GCGTCTCTTT TCACTACCTT CCTGGGAGAC GGCTACCTCG CATTCTTTCT AAGGCTTCGG ATGCTCCACT TCAGGATAGG CAGCGAGGTT TTCTACCTCC CAGCTGAGAG CGCCATAATC AACGCTAGGC TTGCCGTGGA GAGGGCTAGC GAGTACACCA AGTTCGTCTC ATTGGTGACG AAATGCGCTA AGATCAAACA CTTCCTATTC GTCGGCTTCG GATTACCTCG GAAGAAGGGT AGGAAAAACG GGCAGAGAAA CAACCCGTTC TACGCCGAGA TAGCAGGGGC ACAGCTACAC CTAGCCTATG TATCCAGCAC CAACAACATT TACGCGAGGA TCGCAGTCGA AGCTGTGCCT TCGGGCTGGG TGGAGGAGGC ACGCGCTCAA GGCTGGGACG TCCGGGTGGT TCGAATGGGT GGGGTAGGGA GTACTACCAG GTTACACACG CCTCGCTAA
|
Protein sequence | MMGYEEIEKT VREETWRRAA LSLYATRTED KRRSRGKKKR GEIHYRGLYD TVSGINWDFT KFLVNGYSVV PDSVYPRFYR FIDYDLRKYL LLNDDEKPRE GGAVMELKGR LQAIVDAGAD GLRAEKKGKV WHVYIPRENW HVRVSKPTHG WSVHIPLEGF WVESEFPQVL VNTPSDVLRS LQKGWILTDV TPPHGRYSDV RFGTTQPWQL PATLATFPSD DARLGVTAGI LGSTRLSFQW QVRVYGYEEE LGWASTLIGA TKRVEFRRLV EECKELNGDP ASLFTTFLGD GYLAFFLRLR MLHFRIGSEV FYLPAESAII NARLAVERAS EYTKFVSLVT KCAKIKHFLF VGFGLPRKKG RKNGQRNNPF YAEIAGAQLH LAYVSSTNNI YARIAVEAVP SGWVEEARAQ GWDVRVVRMG GVGSTTRLHT PR
|
| |