Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1348 |
Symbol | |
ID | 4600872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1298935 |
End bp | 1300320 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639774123 |
Product | hypothetical protein |
Protein accession | YP_920748 |
Protein GI | 119720253 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.233961 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTATGA AGGCAAAGAC CCTCGCACTG CTAGTAGCAG TAGCAGTGGT CGCCGCCTAC GTGATGTTCA CGGCGCCTAA AATGCTACCC CAGCAGCCGG CTACAGCGCC CGCCGAGAAG CCTAGAAACG CCACAGTCGC TCTGAGGGTT CACGGCGTTG GGGCCCTGCT CGTCAACGGC TCAAGCTACG TGAACACCAC CCTGACGCTC AGGGTTCCGG CGGTCCTGGC TATCAACGCG TCGGCGCCGA AGGGCTGGAG GCTGAAAACG CTGATGGTGA ACGGGTCGCC CGTTTTACCT GGCGTAGCGA AGGTGTATGG GAACACTACC GTCGAGGCTG TGTTTGAAAG GGTTTTCTGC ACTGTTTACC TGCGCGCGAA CGTCGAGGGT GCTGGGGCTA GGGTTAACGG CTCCCTCTAC CGGCTCCCGG CGTCCGTCGA GGTTCCGTGC CCGTCGACAG TGCTCGTCGA GCCCGTAGCC CCGGGCGGCT ACAGGCCCGT CAACTCCTCG GCCGCGCTGG AGGTCTCCTC GAACTCCTCG TTGCTCCTAG TCTTCGAGAG GTCCAGGAGG GTGGCGCGGT TCGTCAACGT CAGGGTCCCC GTCTCCCTCA ACGGGACTGT CTACGCGGGG GACTTCGAGG TGGGCTTCGA GGGGGTGCTT AGGCTGAACC TCTCGCCTTA CGGCGTGGAC GACGCTGGTT GCGTGCCGTT CAACGAGACC GTCAAGGTGT GCCTCGAGGG GTGGAGGAGG CTCTCGACGA ACCAGACGCT CAGGGCGAGG TGGCTCGCGC TCAACCTCTC GGATAGCGAG GTATTCGAGC AGGTGTGGGG CTATGCGCCC GCGAAGCTCG AAGGAGCAGT CATAGAGCTG ATAGCCGGCA ACACCACCGT GAAGACCGTC GCGCAGCCCT CCAAGATGAT GGTCGTACCC TTCCGCGCCG TGTACAAGTA CCTGGGCAAC GGCTGGTTCT GGGTAAACGG CTCCGACTGG GTCTTCTACA TATCCATGCC TCCCTGGAAG AAGATAAAAG TAGAGCTCAA CTACACGGCT TGGGGAGGAG CATGGGGAAG GCTCGCGGTG GTCGTCAGGA ACGACAACCT ATACTTCGAC GTGGGCGCAG CCTTAGGCAA CACGCCTACG CTCACCTGCG TGATAGACAG GGGCATAATC GACCTCTTCT ACCCCTGGAC CTTTAAGAGC GAAGACGAGA TCAACCGACT CTTCATACAA ACGGACTACT CCAAGTACTT CTCCTGCGCA ACTTGGGAGA GGAGGACAGC CGCGGGGCCA TCGCTCCGCG TAGAGCCGGG AATGAAGCAG GGAGACCTGA GGTTCGATGG CACAGGGGAA GCCTACATAA GGATAACGAT ACTCGAGCAA CCCTAA
|
Protein sequence | MGMKAKTLAL LVAVAVVAAY VMFTAPKMLP QQPATAPAEK PRNATVALRV HGVGALLVNG SSYVNTTLTL RVPAVLAINA SAPKGWRLKT LMVNGSPVLP GVAKVYGNTT VEAVFERVFC TVYLRANVEG AGARVNGSLY RLPASVEVPC PSTVLVEPVA PGGYRPVNSS AALEVSSNSS LLLVFERSRR VARFVNVRVP VSLNGTVYAG DFEVGFEGVL RLNLSPYGVD DAGCVPFNET VKVCLEGWRR LSTNQTLRAR WLALNLSDSE VFEQVWGYAP AKLEGAVIEL IAGNTTVKTV AQPSKMMVVP FRAVYKYLGN GWFWVNGSDW VFYISMPPWK KIKVELNYTA WGGAWGRLAV VVRNDNLYFD VGAALGNTPT LTCVIDRGII DLFYPWTFKS EDEINRLFIQ TDYSKYFSCA TWERRTAAGP SLRVEPGMKQ GDLRFDGTGE AYIRITILEQ P
|
| |