Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0943 |
Symbol | |
ID | 4601072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 893994 |
End bp | 895250 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639773721 |
Product | hypothetical protein |
Protein accession | YP_920346 |
Protein GI | 119719851 |
COG category | [S] Function unknown |
COG ID | [COG1602] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAGGAGC CGCGGTTTCC CGGCGTTGTC TCGCGCCCGT CCCCGTACCT CTGCGCCAGG TGTAAGGGCT CGAGGAGGCT CTGCGGGCTA CCGTACTGCC CCATACTCGT CAGGGCTAGG GAGCTCGCGG GGGTCTACGA GAGGGTGAGG GGGCGCAGGG AGCTCGAGGC GCCGTCTCCC CCCTCGGTGC TCGTGGGCGA GAAGGGCTAC CCGTACATAC GGGTGGGCGT TAACCTCGTG GGCGGGGAGG AGGGGTCTCC GAGCCTCTAC GAGGATCCGG GAGCGTGGTG GGGGAGGCTC GACCTCTACG AGGTCTTGAG GCTGAGGGCC TCCATGGTCT ACTCGTACGG CGTGTACAGC GCGTCGAGGG CTGTTGGCAG GGTCGGGGAG TCAGTCAGGG AGGCGGCGCT CTCGCTGAAG CCCGTGGAGT CGGAGGCGGT GTTCAGGCGT CCCCCGGACT TCTCGATGCG CTTCGACCCG CTATTGAAGC CCGTCGGCTT CTCCGCGGAG GTCGAGAGGC TGTCGGTCGT CAGCAACCCC TACATCCCCA GGAGGGTGGA CCAGCTCATC GAGGACAGGG TTAAGGCGAG CCGCGCCGCC GTGGAGCTCT ACGAGAGGGG GTTCGACGTC TACTACATCC AGAGGGTTCT CTCGTCGGGC GCGCTCGGAG TATCCAAGAA GTTCGTGCCC ACCAGGTGGG CTATAACGGC TGTCGACAGG CTGATAGGAG ACTACCTGCT GGGCAAGGTC AAGGGCTACC CGGAGGTCTC CTCCTACGAG CTGTACCACT CGTCGTACAT CGGCAACTAC TACAGCCTGC TCCTCATGCC CGGGAAGTGG TCGCTCGAAA TGGTCGAAGT CTGGCTACCG AACTCCGTGT GGGTCCCGGG CTCCGAGCCC TACGTATCCA CCGTGCACGA GTGGAGCGAC GGGAAGCCGA GCGGGGAGGA CGGGGGCTAC GAGGCTATCC GGCTAGCTGT GCTCGAACAC CTCGCGAAGC GGGGAAGGGT TGCCAGCGTC CTTGCAGTGA GGGAGATCAC GCCCGAGTAC TTCGCGCCCG TTGGCAACTG GCAGATACGG GAAAGCGTGA GGGCGGCGCT GAGGGGTGGC GGCGAGAGCT TCCCGAGCCT CGGGGAGGCT CTCGAAGCGC TGAAGGAGAA GCTCAGGGTA CCCCTGAACT TAGTGCTCTC GAGGAGCACG TTGCTCGCGG CGAGGACTAG GCAGCGCTCA ATCACCGAGT TCCTGCACCG CGAATGA
|
Protein sequence | MEEPRFPGVV SRPSPYLCAR CKGSRRLCGL PYCPILVRAR ELAGVYERVR GRRELEAPSP PSVLVGEKGY PYIRVGVNLV GGEEGSPSLY EDPGAWWGRL DLYEVLRLRA SMVYSYGVYS ASRAVGRVGE SVREAALSLK PVESEAVFRR PPDFSMRFDP LLKPVGFSAE VERLSVVSNP YIPRRVDQLI EDRVKASRAA VELYERGFDV YYIQRVLSSG ALGVSKKFVP TRWAITAVDR LIGDYLLGKV KGYPEVSSYE LYHSSYIGNY YSLLLMPGKW SLEMVEVWLP NSVWVPGSEP YVSTVHEWSD GKPSGEDGGY EAIRLAVLEH LAKRGRVASV LAVREITPEY FAPVGNWQIR ESVRAALRGG GESFPSLGEA LEALKEKLRV PLNLVLSRST LLAARTRQRS ITEFLHRE
|
| |