Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0494 |
Symbol | |
ID | 4601328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 446208 |
End bp | 447377 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639773261 |
Product | Fmu (Sun) domain-containing protein |
Protein accession | YP_919904 |
Protein GI | 119719409 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases |
TIGRFAM ID | [TIGR00451] uncharacterized domain 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.201395 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCTAGCG TTGGTGCAGT AGCGTCTAGG TATGGTTACA GGGTTGACTT CGTGGAGTAC CTTGCGCGTT TTTTCAGCCT AGGCTACATC GAGGATCTCT TTAGGAGCTT GGAGACTCCG GGATCCAGGT ATTTCTTCAG AGTTAACACG CTGAAGGTAC GTCCGGAGGA CCTTCTCGAA GAGCTCCGCG AAGAGGGGTT GAAGGTATAC AGGCATGATC GTTTACCGGA GGCTTTCTAC GTGCCAGTAG AGGGGCCTTT CGAGGTCAGA CTGCTACCTG GAAAGGTCTA TGTCGATAAA AAGACGGCGG AAAGCGTGTA CGTGGGTGCC AACGTATTCG CACCCGGAGT CACGAAAGTA GAGAACGCGA GGGAGGGGGA CCTTGTATCC GTGATTGCCC CTGGAGGACG TGTAGTTGCA GAGGGAGTTT TGTCTATGGA TCCTTCCAGG GTGTTCTCGG AACGCAAAGG GCTCGCCGTA AGGGTTGTGA GGTCTGTTTA TAGGGCTGTA TCGCTCCGCG AGACTAGGTA CTTTGACGAG GGGCTAATCT ACCATCAATC TTTACCCTCT ATGGTGGCTG TACGCCTGCT AGACCCTCAG CCCGGCTGGA CGGTGCTTGA TATGTGTGCC GCTCCAGGTG GGAAGACTAC GCATGCGGCA CAACTGATGG GCGACCACGG GGAGGTTATC GCTGTTGACA GGACGAAGTC GAAGGTTGAT ACGATAATGG AGCATGCGAG AAGGCTGGGG CTTAAGTCCG TGAAAGGGCT TGTTTACGAT AGCCGTTACA TATCGGAGTA CTTGGACAAG GATAGCGTAG ATGCAGTTAT AATTGATCCT CCCTGCACGG CGCTCGGAGT TCGCCCGAAG CTGTGGTACG AGAGAGGCTT TGACGACGTG TTGAAGCTAT CAGATTACCA GAGACAGTTC CTCAGAGAGG CCGCCAAAGT TCTGCGTAGG GGTGGGCGCT TGCTATTCAC TACGTGCACT ATATCGCCTT ACGAAAACGA GTTCAACGTC ATATTCGCGG CTTCGTATCT AGGACTAAAG CCTTTACCCC TGAGCTTTCC CCCCGTAAAG AGCGGCTTAC TGGGCACGGG TTCCCTACAG TTCGTGCCTC ACGAACACGA CACCCCGGGG TTCTTCATAG CACTCTTCGA AAAACGTTAG
|
Protein sequence | MSSVGAVASR YGYRVDFVEY LARFFSLGYI EDLFRSLETP GSRYFFRVNT LKVRPEDLLE ELREEGLKVY RHDRLPEAFY VPVEGPFEVR LLPGKVYVDK KTAESVYVGA NVFAPGVTKV ENAREGDLVS VIAPGGRVVA EGVLSMDPSR VFSERKGLAV RVVRSVYRAV SLRETRYFDE GLIYHQSLPS MVAVRLLDPQ PGWTVLDMCA APGGKTTHAA QLMGDHGEVI AVDRTKSKVD TIMEHARRLG LKSVKGLVYD SRYISEYLDK DSVDAVIIDP PCTALGVRPK LWYERGFDDV LKLSDYQRQF LREAAKVLRR GGRLLFTTCT ISPYENEFNV IFAASYLGLK PLPLSFPPVK SGLLGTGSLQ FVPHEHDTPG FFIALFEKR
|
| |