Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1552 |
Symbol | |
ID | 4600901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1501021 |
End bp | 1503177 |
Gene Length | 2157 bp |
Protein Length | 718 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639774326 |
Product | hypothetical protein |
Protein accession | YP_920951 |
Protein GI | 119720456 |
COG category | [S] Function unknown |
COG ID | [COG1649] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.948257 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAGGA TACTGCAGTT CAACTTCGAG GATAAGCTAG CCCAGTACGC AGACCGCGTT ACCGGCGCGG ACATAGTGGG GCTCGCGGCT GAGCTCCACT GCGACACTGT AGTCATCTTC GCGAGGGACG CCTGGGGGAG GGCGTACTAC GACAGCGCCG TTGCAAGGAA GGTAGCAAGC CTGAAGTCCA GAGACTTGCT ACGCGAAGTA GTGGAGGAGG CTCACAGGCG CGGAATAAAG GTCGTGGCGA TGATAGGGCA TACGACGAAC CCGGAGCTGT ACAGCTCCCA CCCCGAGTGG GCTCAGCGCG ACAGGAACGG GAGGGTAATA CACATGGACA CGGATCCCCA GGGCGTTAAG GACAAGGTTA GGTGGCCGCT CATGTGCCTG AACTCCCCGT TCCTCGACTA CGTGCTCAGG GAGGCGGAGG AGGTCCTGCG CTATGGTGTC GACGGCGTGT TCCTCGACTC GTTCAGGTAC ATGCCCGACG TGGAGAGAGC CTGCTTCTGC GAGAACTGCA GGAAGGCTTA CGCGGAGGAG GTCGGAGGAG AGCTACCGTC GGAGGAAGAC TGGGACAGCG AGGCCTTCAG GAGGGCCTTC GCGTGGAGGT ACAGGGTGAA CGTGAAGGCT CTCGAGAGAG TTAAGGACTT CGTGAGAAAG GCGAAGCCGG GAGCCTTCCT CGTGTACAAC AGCCACCCAG CCGGCTGGAG GGGCAGGGCG AACACGATAG TCGAGATGTC CAGGAACGCC GTCGACGTCG TCTTCGCCGA GGGCTCCGAG GCTGACTACC AGCCGCCGGG CTTCCTCGCC GAGATAGTCA AGCTGTCCAA GGCTATGGGT GCAAAGAGGG TTTGGGCAAC CCGCAACTCG TTCCACATGG CCCTCACAAC AACCACGACG AGCCCCGTAG TAGTAAGGCA GGGGATACGC GAGATATTCG CGGCGGGAGG GGAGCCCATG CTCCTGGTAT TCAGCTCCGC CTTCGTTCAG TCCCCGAAGG GGCTGAAGGC AGCCGCCCAG GCTTTCAGAG AAGTAGAGGC GCTAGAAGAG TACATGGAGG GGGCCGAGCG GCTACGCTAC GCGGGCGTAG TCTACTCGAA TAGGAGCAGG GACTGGCTCG GGCGGAGCGA CCCGAGGCAC GTGACGGACG AGGCGAGGGG GCTCTACTAC GCGCTGGCGT ACAGCGGGTA CCCTGTCGAC TTCGTATCCG ACACCCAGCT GGACTCGGGA GAGCTGAAGG GCTACAGAGT GCTCCTCCTG GGCAGTGTCG CCAGTATGTC GAGAAGGGGG GTAGCGTCGC TCGCAGAGCG CGCCGCGAGA GGCCTTGGAG TCGTAGCCAC GTACCTGACG TCCACGATGG ACGAGGACGG GCGCCAGCTA GAGGAGTTCC AGCTATCGGA GCTACTCGGA GTCTCGTACA AGGGGGTTCT CGAACTACCG TGGAGCTACG TCCTCCCACA CGGCGAGCAC CCCGTGACGG AGGGACTCCA GGGCGAGGCT ATCCTCTGGG GCGACTACGA TAGGGTGTTC AACGGTAGAA GGGTTCCGCC GAGCATCGCG TGGCACGCTC GCGTGAAGGC ACTCGAAGGA ACCTCGGTGC TGGGATACGT GGGCGAGCCC GCCGGGGAGT ACGGCTACGA GTACGAGAAC GGGAGGTCTC CGCCCCTCCT GGGGTCGCCT ACAGGGGCAC CGGCCATCAC CGCGAGGGAG GAACCTCGCG TCGTGTACTT CTCCGGACAA CTCGGCAGGC TGTTCTGGAG GACCGGCCTC CCCCAGCACG AAGCGCTGAT ACTCAACGCC GCCAGGTGGG CCGGCGGAGA GCCCCCGCTG AAGCTGGAGA GCGAGGGACT CGTTCTCGTC GAGCCTTACA CGAGAAGCGG TCAGCTAGTC GTGCACCTCG TGAACCTAAC GTACGACAGG AGGATCATTG TACGCGGAAA CACCGCGGAT CCCGATGCCT GGCACTCGAC GAGCGAGAGC GTGATGCCGC CTAGGCGCGT AGTACCCGTT GAAGCACACC TACGCCTAAG AGGCTTCGCC CCGAAGAAAG CCTACTCCCC CCTAACCTCG AAGAAGTACG AGGTAGAGAC TAAGGGCGAG GAGGTTGCTA TAAGAGTACC CCTCGAGGAG TACGAGGTAC TGGTGCTCGA CCTCTAA
|
Protein sequence | MARILQFNFE DKLAQYADRV TGADIVGLAA ELHCDTVVIF ARDAWGRAYY DSAVARKVAS LKSRDLLREV VEEAHRRGIK VVAMIGHTTN PELYSSHPEW AQRDRNGRVI HMDTDPQGVK DKVRWPLMCL NSPFLDYVLR EAEEVLRYGV DGVFLDSFRY MPDVERACFC ENCRKAYAEE VGGELPSEED WDSEAFRRAF AWRYRVNVKA LERVKDFVRK AKPGAFLVYN SHPAGWRGRA NTIVEMSRNA VDVVFAEGSE ADYQPPGFLA EIVKLSKAMG AKRVWATRNS FHMALTTTTT SPVVVRQGIR EIFAAGGEPM LLVFSSAFVQ SPKGLKAAAQ AFREVEALEE YMEGAERLRY AGVVYSNRSR DWLGRSDPRH VTDEARGLYY ALAYSGYPVD FVSDTQLDSG ELKGYRVLLL GSVASMSRRG VASLAERAAR GLGVVATYLT STMDEDGRQL EEFQLSELLG VSYKGVLELP WSYVLPHGEH PVTEGLQGEA ILWGDYDRVF NGRRVPPSIA WHARVKALEG TSVLGYVGEP AGEYGYEYEN GRSPPLLGSP TGAPAITARE EPRVVYFSGQ LGRLFWRTGL PQHEALILNA ARWAGGEPPL KLESEGLVLV EPYTRSGQLV VHLVNLTYDR RIIVRGNTAD PDAWHSTSES VMPPRRVVPV EAHLRLRGFA PKKAYSPLTS KKYEVETKGE EVAIRVPLEE YEVLVLDL
|
| |