Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1703 |
Symbol | |
ID | 4601665 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1643713 |
End bp | 1645086 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639774476 |
Product | hypothetical protein |
Protein accession | YP_921101 |
Protein GI | 119720606 |
COG category | [S] Function unknown |
COG ID | [COG4938] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0867999 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAATGC CTGTTAATGC GCGTGTATAC GTGAGGGACT TCGGTCCTTT CGAGGAGGCG AGTATAGAGG TTAGGCCGCT GACGGTTCTC GTGGGCAGGA ACAGTGTGGG TAAGTCGATG CTATTGCAGC TAGTGTGGGC GCTGACAGTA GCGATGCCCG ACTTAAAGTT GCTCGGCGAG GCTGTCTCAG AGCTCGGGGC AGGGGAGCTC GTAGGCGAGG TTTTGGAGGG TGTCGAGAAA GGGTCTGCGT CCCGGGATAG CTTCAAGAGG CTCCTCAAGC TATACCTGGA GGCGCTTCCC GGGGCCCTCG CTACGGGCCT CGGCAGGACG CTCGAGGAAG TGTTCGGCGC TGGTCTGCAG GAGCTCGTGC GGAGCGGCTC GGGGCGGGCG GTTGTGGGTG TTGCGGGCTC GTTTTCCTCC ATCGAGTTCG TGATTGAGGG TGGGCGCCTC GCAGTGAGCC GGCACAAACC CTACGAGGGC TTCCTGGACG AGCTAGAGGT CACTGTGCCG GCGCCTGGGA GGCTTAGGAT TATTCATAGG CTCTCCGGCG TCAGTCTGTA CGACGAAGCG GTTTTGAGTC CGTCCGACTT GGTTGACGCG GTGCTCAAGG TGTTAGCGAT TTATTTGTAC AAGGCTCTCG ACATCTTCTT CGAGGCTCCC GGCGTTTCCG TGCTTCTGCC TGACAGCAGG GCTGGGCTTT CAAGGATCCT CCTTAAGCCC TACGCTAGGC CTAGCTTGCT TAAGGACGTG TTGTACCCCG ACGAGCACTT CAGGGACGCC TACTTCATGC TGGCCGAGAG CCTGGCCGAG GGGAAGGTCG ACACGGGAGA CTTGGAGGAC TTTCTGAGAG AGCTCGGTTG TAGCGTGGAG GCAATCCCGG AGGGCGGGGT GCGCGCAGTA TACGTCAACA CGTGGAGTGG CCAGAGGCTT CCCCTGCCCC GCGCCCCCTC GGGTGTGCGC GAGTCGCTAG CTGTAGCGCT GGCCCTCGTG GTTCCAGAGC AACCATGGCT AGTGTTTATC GAGGAGCCCG AGGCCCACCT GCATCCTCGG GCGCAGAAGG CTTTAGCGAG GCTTATCGCT AGGGCTGTCA AGAAGCACGG GAAGGTGGTG GTCCTCTCGA CGCACAGCGA TTACCTGCTC TACGCGGTTA GCAACATGGT GGCGTTGTCC TCGTCGCCGG GTGTGGCGGA GAGGCTGGGG TACAGCGCGG CCGAGGTTTT GGATCCAGGG CTCGTGGCGG CCTACTTGCT CAGGGCTGAG GGGAGGAGGG CTGTTGTCGA GAGGTTGGAT GTGGGGCCGG AGGGTGTGCC TGAGGAGGAG TTCGTGAGGG TCGCCGAGGA GCTGGCGGAG GAGAGGGCGA GGATCCTGGC TTAG
|
Protein sequence | MGMPVNARVY VRDFGPFEEA SIEVRPLTVL VGRNSVGKSM LLQLVWALTV AMPDLKLLGE AVSELGAGEL VGEVLEGVEK GSASRDSFKR LLKLYLEALP GALATGLGRT LEEVFGAGLQ ELVRSGSGRA VVGVAGSFSS IEFVIEGGRL AVSRHKPYEG FLDELEVTVP APGRLRIIHR LSGVSLYDEA VLSPSDLVDA VLKVLAIYLY KALDIFFEAP GVSVLLPDSR AGLSRILLKP YARPSLLKDV LYPDEHFRDA YFMLAESLAE GKVDTGDLED FLRELGCSVE AIPEGGVRAV YVNTWSGQRL PLPRAPSGVR ESLAVALALV VPEQPWLVFI EEPEAHLHPR AQKALARLIA RAVKKHGKVV VLSTHSDYLL YAVSNMVALS SSPGVAERLG YSAAEVLDPG LVAAYLLRAE GRRAVVERLD VGPEGVPEEE FVRVAEELAE ERARILA
|
| |