Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0964 |
Symbol | |
ID | 4600438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 914482 |
End bp | 916284 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639773742 |
Product | hypothetical protein |
Protein accession | YP_920367 |
Protein GI | 119719872 |
COG category | [S] Function unknown |
COG ID | [COG3410] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.624537 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTTGACC CGGACCCAAG CAAAGCGAGA GACGTCGTCG GGGCGCTAGT CGAGGGCTAC AGAAGGTTCT ACGGCGAAGA CCCCTCCGGC GAGCTCGTAG CGTCGTGGAG TAGCAGCGTT GCCCGCGTCC TAGGCGTCCT GGAGAGGGCG GGAGGCTTCC CCGCGGTGCT CGAGCTACCG CTGTTCGGGT CCGAGAGGGC CGACTTCGTA GTCGTCGGCA GGGGTAGGGC GCTCGTAGTC GAGGCGAAGG GCTGGAGCAC GGTTGAGAAG CTGAACTACG TGGTGCAGGT CGACGGGCTG AGAGAGGTGG ATCCGTGCTA CCAGGTGGAG AACTACGTTT CGAAGCTCAA GTACTTCAGC ACCGCCGCGG ACAGGGTGAG GCACTTCGAC GGGGTAGCGT ACCTCTACGG AGGCGCGAGC TACTCGGATG GCTGCAGGAT CGCGAGGAGC GACGCGGAGC TGGAGGAGTA TGTAGGCTCC CTCGGCTCGC CCGGCGACGA GGGAGACGTC GAGGCGGTAG CAAGCGCCAA GTTCACGGTG AGGAGGGATA TAGTCGAGTT CCTGCGGAGC CACAGAGACA AACTCCTCAA GGAGGCGGCG CGCTTCCTAG CCTCGGAGGG GTACGGGCTG GGCAGGGAGC AGCTAGTCCT GGTGCACGAC GTCCTAGAGG CGCTGGAGGC GGGGTCTAGG AAGGCTTTCT TCGTCAGGGG CGGGACCGGC TCCGGGAAGA CGCTCGTCGC CCTAACCCTC CTCTTCGAAG CGGTTTCGAG GGGCTACCAC GCCGTCCTGG CTTACAAGAA CAACAGGTTG CTCAACACGC TCAGGTACGC CCTCTCGTTG CGCGCACCGC GCGGTGCGCC CAAGCTCAGC GCACTCATAG TGTACTACTC TACCGGCAGA GGGCACGGGC TCGGGGAGAG AAGAGCGTAC GAGAAGGGAC TTTACAGAAA CCTGAATCTC GCGGTGCTCG ACGAAGCGCA GAGGATGACG CTCGAGAACA TCGAGTACAC AATGAAGAGC GCCCCCGTCA CCGTCTACTT CTACGACGAC AAGCAGATAC TCATAGGCTA CGAGGAGGGC TTCAGGGAAA ACTTCCTCGA AGCGGCGGAG AGGCTCGGGC TCGCCTACGA CGAGAGGGAG CTGAAAACGC TCTACAGGGT CCCGCCCGGC TACGTGAAGC TCGTGGAAAG CCTGGTCTAC AGCGGGGCAG TCGCCCAGCA GGACGTCCAG GGCTACGACA TCAAAGTATT CGATAACCCG GCAGACATGC TTGAAGCCCT CCAGGAGAAG GCGAACAAAG GCTTCAAGGT AGCCCTCGTG TGCGCCTTCA CGGAGACGAG GGGCGACAAG AACGACCTGA ACAGCCCGGA GAACAGGAGG CTCACAGTCA AGCGCGGAGA CCGCGAAGAA GTAGTCACGT GGCTCATGGA CGAAAAAGAG GAGTACCCCA AGTATTGGTG CGGAGAGCTG GGAAACCCCC TCACGCGCTG CGCCTCAGTC TACGGGGCAC AAGGCTTCGA GGCAGACTAC GTCGGAGTAG TATGGGGCAG AGACATGGTC TGGAGGTGCG GGCCGCTGGG CTGTGGGTGG AGCGTAAACC CCGACGCCAT AACAGACTAC GTAGGCGGGC AGTACTCGCT GGAGAAACTA GCCAGGAAAG ACCCCGGCAA AGCCCTCGAA CTACTCAAAA ACAGGTACTA CATCATGCTG ACAAGAGGAA TCAAGGGAAC CTACATATAC CCCGAAGACG GGGAAACAGG GCGCCTACTC AGAGAAGTAG TCGAAAAGCT ACAGCAACAC TAA
|
Protein sequence | MVDPDPSKAR DVVGALVEGY RRFYGEDPSG ELVASWSSSV ARVLGVLERA GGFPAVLELP LFGSERADFV VVGRGRALVV EAKGWSTVEK LNYVVQVDGL REVDPCYQVE NYVSKLKYFS TAADRVRHFD GVAYLYGGAS YSDGCRIARS DAELEEYVGS LGSPGDEGDV EAVASAKFTV RRDIVEFLRS HRDKLLKEAA RFLASEGYGL GREQLVLVHD VLEALEAGSR KAFFVRGGTG SGKTLVALTL LFEAVSRGYH AVLAYKNNRL LNTLRYALSL RAPRGAPKLS ALIVYYSTGR GHGLGERRAY EKGLYRNLNL AVLDEAQRMT LENIEYTMKS APVTVYFYDD KQILIGYEEG FRENFLEAAE RLGLAYDERE LKTLYRVPPG YVKLVESLVY SGAVAQQDVQ GYDIKVFDNP ADMLEALQEK ANKGFKVALV CAFTETRGDK NDLNSPENRR LTVKRGDREE VVTWLMDEKE EYPKYWCGEL GNPLTRCASV YGAQGFEADY VGVVWGRDMV WRCGPLGCGW SVNPDAITDY VGGQYSLEKL ARKDPGKALE LLKNRYYIML TRGIKGTYIY PEDGETGRLL REVVEKLQQH
|
| |