Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1526 |
Symbol | |
ID | 4600663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1472484 |
End bp | 1474061 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639774300 |
Product | hypothetical protein |
Protein accession | YP_920925 |
Protein GI | 119720430 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAAAA CCGTTCTAGC CCTCTCGATA GCCATGCTCG CGCTGGCAGG CGCCTACGCC GCCGTCACGA TAATCAACCT GACGGAGTGG CACGTCCAGG CGACCCAGCC CCCCGTCAAG AAGCTAGCCG CTTACGGAGT CAAGAGAACC GCCGCGTACG CGCAAGCGGA GAACGGCTTA AACGTGACGT ACGTCGAGGT GACTGCGCCG CTCGGTTGCA GGGTCATTTT CGACCCAGCA CTAGTCCTGT CCGCTTCAAC CAATCTACCC GCGAAACTAT ATGCGGAGTC CGTAACCGGT AACTACGCGT TTCAGCTATC GGTAAACGTA AGCCTCGGCG GCACGCAACA AGTACAGGTA GTAAGCGGGT CGATAACGCA GAGTGCCGGC GCGCCTGTAA GCGTGCAGGG AAGCGCGAAC TTCAGGCTAG AAGTCCTCGT GGGCTCTGGA GCACCTCTGG GCTCGGAAGT CGCCAGGCTC TACACGTGGC TTTACTTCAA TGCAAGCACG TCCAAGGCTA GGCAGAAGAT AGTCTGGGTA TTCAAAACAA TGAGCTGGCT CTACTGCGGC TGGCTGTATA GGAAGGGCCA CGACATTGTT GGCTCTACTG CGGGGGCTGT TTCCGGCTAC CAGGTGAGGC TCAGGGTGTG TAGTGGTAGC GGGACTGACT CGGGGGATAC TGTCTACTTG AACGGGAAGG GGAGGGCGGA CTTCGGGGAC CTCAGGTTCT CGTACCTCCA GCCCGACGGG AGAGAGGTAG CTATACCGTA CTGGGTGGAG AGCGTTAGCG GCGGCTGCGC CACCGTATGG GTGAAGGTGC CGAGCATACC TGCCAGCCCC GGGACAGCCA GGATATACGT CTACTACGGA AACCCCTCTG CTACTAGCGA GAGCAACGGA GCGGCGACGT TCATCTACTT CAACGACTTC TCGAGCGCCA TAGGTTTCAC GAGCTTCCCA TACGACAAGG GGGCGGACGG CTGGGGCGAC GTGGACGGTA GCGACTCTTC TGGCTGGCAG CTCTTCCAGG GGGCTCTCAA GCACCGCGTG ACTGTTGACG CTAGCCACAT GTACGCGTTG ACGGACGTCC AGAATGGTAA CGTGGCTGTA TGTGTCCTCA TAAACCTCGG GAAGCTCGAG GAGATAGAGG AGGCGGGAGT AGCGGCTAGG CACAGTGTCG ACGCCAACGG CTACATAAGG CAGTACTACG CTAGGCTCAT ATACGCGCCG AGCCTGATCG ACTCGTTAAC GTTTAGGCCA AGAGTTAACG TGGAGCTCGT TAAGAGTACT GGTCTGCCGA GCGAGGTAGA GCCCGTGCAC GTCTATTACA GCTTGCAGGC TAATACTTGG TACAAGCTTG AGATGAGGCT GTACGGTGGG CACATAACAG TGTACGTGGA CGACTCTAAG GTTATCGACT GGACCGACTC AAGCCCGATA ACCTCCGGCT CCAGGCTAGG CCTCTTCAGC GCGTACAACA AGAACGTCGA CCACCTATAT GACAACTTCT ACGTCCGCAA CTACGTAGAG CCAGAGCCAA GCCACGGGCA GTGGCTCCCG GAGGAGACGG ATCCTTAA
|
Protein sequence | MRKTVLALSI AMLALAGAYA AVTIINLTEW HVQATQPPVK KLAAYGVKRT AAYAQAENGL NVTYVEVTAP LGCRVIFDPA LVLSASTNLP AKLYAESVTG NYAFQLSVNV SLGGTQQVQV VSGSITQSAG APVSVQGSAN FRLEVLVGSG APLGSEVARL YTWLYFNAST SKARQKIVWV FKTMSWLYCG WLYRKGHDIV GSTAGAVSGY QVRLRVCSGS GTDSGDTVYL NGKGRADFGD LRFSYLQPDG REVAIPYWVE SVSGGCATVW VKVPSIPASP GTARIYVYYG NPSATSESNG AATFIYFNDF SSAIGFTSFP YDKGADGWGD VDGSDSSGWQ LFQGALKHRV TVDASHMYAL TDVQNGNVAV CVLINLGKLE EIEEAGVAAR HSVDANGYIR QYYARLIYAP SLIDSLTFRP RVNVELVKST GLPSEVEPVH VYYSLQANTW YKLEMRLYGG HITVYVDDSK VIDWTDSSPI TSGSRLGLFS AYNKNVDHLY DNFYVRNYVE PEPSHGQWLP EETDP
|
| |