Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1402 |
Symbol | |
ID | 4601816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1354892 |
End bp | 1356460 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639774177 |
Product | hypothetical protein |
Protein accession | YP_920802 |
Protein GI | 119720307 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1361] S-layer domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCCAA TAAAGGCGGA ACGTATGAAG ACTGGACCGG TCTTGCTCTT GATAGCGGTA ACCGTAGCGG CAACACTCGT GGGCGCGGGC CCGGTGACCG TCAGCCCCTC GGGGAGCGCC AGGGCCTACG CGTTCGTACT GGAGGATGTA AGCGTTCTGG ACAGCGGCGT CCCCTCAGGG GTGGTGCAGG TCTCGGTGAG CTACTACGGG GCGTACACGT TGCTCGGGGC AAGCCTTGGG CTAACGGCTC GCTGCGGCGG CGCGGAGGTC TCCGCGGGTA GCGTGGACGT CGGCTCCTGG CGCCCCGGGA CCGTCAAGAC GGCGAGGTTC ACGCTTAACA CGTCGAGCCT CGGGTCGGAG TGCACCCTGA GGGTCTCGGT GTCCTGGGGC GACTCGTGGG ACGACGCCCA GAAGACTTAC ACGGGGCTGG GCGGGTCGAC GAGCCTCGAG TACAGCTTTA CGGCGTGCTG GGGCGAGAGG GTCTCGGTGG GCGTGAGGCC GCAGATGGTG TACTCTTCAA CCGTTAACCC CGTGCTCCTA GTCGTGGAGA ACTCCGGGCG GACCGCGCTC AGGCAGGTAG AGGTGTACGT CGCGCCTCAA GGCTCAGTCC TGCTCAACGC CTCCGTGCCT ACCGTTTTCG AGCTGGGAGA CCTGAAGCCC GGGGAGAGGA GGGTAGTGCC GCTCAGCGTG GTCCCGCAGT CCCCCTTCCC CTCGTTCTCC GTCACCGTGA GCTACCTCGA CTGCTCCGGG TCCAAGAAGA GCGTGGCGCA ACAGGTATAC CTCTACGCGG CGGCGGGGCA GAGCATAGTC GTCGTGCCGG ACCCGCCGGT TCTCGTAGCC GGGCAGGCGT CCAACGTCTC GCTCAGGGTG GTCAACGCCG GCGGGGTCGC CGTTAAGGGG CTTAGCCTCG TGCTGGGGGT CCAGAAGAGC CCCCTGAGCG TCTCCCCGAG CTTCCTAGTA GTCGGCGACC TGGGGCCGGG CGAGTCGAGG AGCGTCCCTG TAACCGTGCT CGTACCGGCG ACGGCTTCTA GCAGCGAGTC CGTCGCCTAC CAGGCTCTCT ACAGCGTGGA GGGAGGCGGG CTGGCGACTA GCGGAGGGTC GTTCACGTTC TACGTCGCCC AGAGGTCCTC CGTGTCCATA ACCTCGGTGG ACGTCGTGCC GCAGAGCCCC GAGGTCGGGT CCAACGTCAT ATTCGCGGTG AGCCTGGTGG ATGACGGCAC GTTCCCCGTC TACGCGGTCA ACGTCTCTGC CTACGCGTCC AGGGGCCTCT CCCCCCTGCG CTCGACCTAC GCGTACCTGG GCCAGCTCAA CCCCCAGGTC CTCACGACGG TCCCGTTCAG CTTCAGGGCA GTCGAGGAGG GGATGCAGGA GGTCAGGTTC GTCGTGACGT ACAGGGACGC GTACGGGTAT TCGAGGAGCG CCGAGAGGAC GGTCTACGTC AACGTGGCGA GGCAGCAGCC CTCGCGCCAG GCGCAGGGCG GGTCCGCGAA CCCGTACGTC TACCTCGCCG CCGTTGCGGT AGCCCTGCTG CTGGCCGCCG CGTACGCCGC GAGGAAGAGG AGGGGGTAG
|
Protein sequence | MPPIKAERMK TGPVLLLIAV TVAATLVGAG PVTVSPSGSA RAYAFVLEDV SVLDSGVPSG VVQVSVSYYG AYTLLGASLG LTARCGGAEV SAGSVDVGSW RPGTVKTARF TLNTSSLGSE CTLRVSVSWG DSWDDAQKTY TGLGGSTSLE YSFTACWGER VSVGVRPQMV YSSTVNPVLL VVENSGRTAL RQVEVYVAPQ GSVLLNASVP TVFELGDLKP GERRVVPLSV VPQSPFPSFS VTVSYLDCSG SKKSVAQQVY LYAAAGQSIV VVPDPPVLVA GQASNVSLRV VNAGGVAVKG LSLVLGVQKS PLSVSPSFLV VGDLGPGESR SVPVTVLVPA TASSSESVAY QALYSVEGGG LATSGGSFTF YVAQRSSVSI TSVDVVPQSP EVGSNVIFAV SLVDDGTFPV YAVNVSAYAS RGLSPLRSTY AYLGQLNPQV LTTVPFSFRA VEEGMQEVRF VVTYRDAYGY SRSAERTVYV NVARQQPSRQ AQGGSANPYV YLAAVAVALL LAAAYAARKR RG
|
| |