Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0873 |
Symbol | |
ID | 4600355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 822959 |
End bp | 824431 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639773651 |
Product | hypothetical protein |
Protein accession | YP_920277 |
Protein GI | 119719782 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGGTA GAGGCATGAC GGGGGGTGAA GAGGGCGAGG AGGTTGTTAG GAAGCTTGTC TACGGGAGGG CGGCTCAAAG CTTGTATGCC ACCAGGACAG AGGATAAGAG GAGGTCTAGG GGCGGGAAGA AGGGCAGAGT ACATTACCGC GGCCTGCATG ATGCTGTGAC CGAAATTAAC TGGGACTTTA CGCGCTTCGT GGCGCACGCG CTGAGTGTCG TGCCGGACGA GGTTTACCCC AGGTTTGGCA GGCTGTTTGA CTACGATTTG AGGCAGTACC TACTACTGGG AGACAGCGAG AGACCGAGAG AGGGAAGCGC GGTCGTGGAG CTCAAAGGAA AGCTACAAGC AATCGTCGAT GCAACAGAAG ACGGCCTTAA AGTTGAGAGG CACGGAAAGA TTTGGCGCGT CTACCCGCCC AGGGAGAACT GGCATGTGAC AGTCTCAAGG CCCAATGCAC ATAGCTGGAC TGTCCACGTC CCATTGGAGG GTTACTGGGT TGAGTCAGAG TTCCCGGAGG TACTCGCGAG GACTTCGCCC GACATCCTAA GAAGCCTGCA GAAAGGGTGG CTTCTTACGG ACGTAATGCC GCCACGTGAA AGTTACCTTT ACGTTTCCTT CAGCACTACC CAGGCGTGGC AACTTCCAGC CACGCTCGCC GACTTTCCAA GCGACGATGT CCACCTCGGC ATTAGAGCAG GAGTTCTCGG CTCCACCAAG ATGAGCATTA TGTGGGGAGT GAGCATCTAT GGCTATGCGG AGGAGCTTTC CTGGGCTTCA AGGCTTGTCG GCGAGGTTAA GCGCGCTGAG TACCGCAGTT TGGTCGAGGA GTGCAAGGCG CTTAATGGTG ACTCGGTGGC GCTGATAACC GCCTTCCTGG GGGACGGCAT CCTTGCGTTC TTCCTAAGGC TTCGATGGCT CTACTTCAGG GTTGGCAACG AGATAGTTTA CTTACCTACG GAGAGCGCTG TCTTTAACGC TCGTGTTGCT GTCGAACGGG CCAGCGAATA CGTAGCATTT GTGGGCAAAG TTACTAGGTG CGCAAAGGTT AGGCACGTCC TATTCGTTGC CTACGGAGCA CCGGGTAAGA GGGGCAGGAA GCCCGGGCAG AAGGTGGCTT GCCAGTGGCT CGGCCTCTAT GCGCCGGTCG CCGGCGCGTT GCTTAACCTC GTTATGGTTA CTGTCGGCGA CGAGTATGCC TACATCTACG CCCGTATACC CGTTGATAGC GCGCCTCCAG GCTGGTACCA GCGCGCGCTG GAGGAAGGCT GGGACGTCCG GGTGGTTCGC ATGAGTGACA GGGAGTACTA CCAGATCCCC CAAGATTCGT TATTTGAGCA CGCGGGCGAG AACCCGGAGC TCTGGGAGGC GCTATACCGC TTCGCAGTTG CAAAGGCGCA GGCTAAGCCG GCCGCGAGGA AGCTAGTCGA AGAACTGCTA AGATTTAAAC CAGCTGGGGC AAAGCAAGAA TAA
|
Protein sequence | MSGRGMTGGE EGEEVVRKLV YGRAAQSLYA TRTEDKRRSR GGKKGRVHYR GLHDAVTEIN WDFTRFVAHA LSVVPDEVYP RFGRLFDYDL RQYLLLGDSE RPREGSAVVE LKGKLQAIVD ATEDGLKVER HGKIWRVYPP RENWHVTVSR PNAHSWTVHV PLEGYWVESE FPEVLARTSP DILRSLQKGW LLTDVMPPRE SYLYVSFSTT QAWQLPATLA DFPSDDVHLG IRAGVLGSTK MSIMWGVSIY GYAEELSWAS RLVGEVKRAE YRSLVEECKA LNGDSVALIT AFLGDGILAF FLRLRWLYFR VGNEIVYLPT ESAVFNARVA VERASEYVAF VGKVTRCAKV RHVLFVAYGA PGKRGRKPGQ KVACQWLGLY APVAGALLNL VMVTVGDEYA YIYARIPVDS APPGWYQRAL EEGWDVRVVR MSDREYYQIP QDSLFEHAGE NPELWEALYR FAVAKAQAKP AARKLVEELL RFKPAGAKQE
|
| |