Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0604 |
Symbol | |
ID | 4601224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 558721 |
End bp | 560058 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639773379 |
Product | hypothetical protein |
Protein accession | YP_920012 |
Protein GI | 119719517 |
COG category | [R] General function prediction only |
COG ID | [COG1571] Predicted DNA-binding protein containing a Zn-ribbon domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACTTA CGAGCGTCCA CGTAGGGCTC GACGACACGG ACTCTCCGCT GGGGCTTTGT ACAACGTTTG TAGCGCTCGC CATAGTAAGG GAAGCCTCTA GGATCGGTGC CGGCTTCGCT GACCTGCCGT ACCTGGTTAG GCTGAACCCT AACGTACCCT TGAAGACCAG GGGTAACGGT GCAGTAGCCA TTCACTTCCT AGTGGAAGAA GAATTGGTGC CCAGGGTTGA AGAGGCCGTA GTGCGCGCTC TCGACGATCT TTCAGAGAGG CATGGCAAAA CAGATCCTGC CGCTATCCTG GTGAAGGGGG ATGTGCCTCG CGCATTTAGA TCAGTGTACC TTAGGGCTTT GACGGAGTTC ATCCCATCCA GCTACGTTAG AAGTGTCCTG GAACGCTTCA AAGGTGAAAG CGTGAAGTTG CTGTACTCTC GTAGCAAGCG ACCTAGGGGG CTCGTGGGAG CTGTTGCATC GCTGGGGGCT TACCCGCTAG AAGATTACAC CTACGAGCTA ATAGTATACA GGAGCCCGGA GGAGCGTAGC GAGTCGAGAG ACATCTCGGA GGATCTACTA CTGGAGCTTG ACAGAAAGTA CCGCCCGCTG GTATTCGCGA CGTACGACTA CTCTTCTCGC AGAGTCCTAG CCGTACCACA CGGACCGGAC CCCGTAATCT TCGGTCTGAG AAGCCTCGAC CCAGAGATAC TCGTAAACGT GGCGGTAGAT GTGCTCGAAA AGATATCCCA CGCGGGCTAT CTCTTGTTTA AAACTAACCA GGGCACCAGC GCCCATCTGC AGAGGTATAA ACCGGTAGCC CTCGTAAGGC CGTATGACTC CGTCGTCGTG AGGGGCTCTG TCGCCGAGAA ACCTACCGTG ATTAGCGGGG GGCATGTCAT TGTGAATGTT TGCGACGAAA CGGGTTGCCT ACAGATAGCG TTCTACAAGG AGACCGGGAG GCTGAACAGA GTGGCAAAGC TACTATCCAG GGGGGACTTG GTAGAGGTGG GTGGAGGCGT CATGAAAAAA GATGGCCTGG TGCTGAACGC CGAGTACCTC AGGTTGATCA AGCCCGCTTT AAACGTAAAG CGGCTGAACC CCCTCTGCCC CAAGTGTGGC TCGAGAATGA CTTCGGCCGG AAAGGGTAAG GGCTACAAGT GCCCTAAGTG CGGCTACAGG GCCAAAGAAG CCGCAAAGAT ATACCGAGTC GCTCCGAGGA CGCTTGAGCC GGGAACGTAC TTCCAATCTC CCTCTGCCTA CAGACACTTA ACGAAACCTC CCGAGATTCT GGGACTACGG CCGGTAGATG CAACCAAAGT GTTGCTCAGC GGAATGTGGT TCATGTAG
|
Protein sequence | MGLTSVHVGL DDTDSPLGLC TTFVALAIVR EASRIGAGFA DLPYLVRLNP NVPLKTRGNG AVAIHFLVEE ELVPRVEEAV VRALDDLSER HGKTDPAAIL VKGDVPRAFR SVYLRALTEF IPSSYVRSVL ERFKGESVKL LYSRSKRPRG LVGAVASLGA YPLEDYTYEL IVYRSPEERS ESRDISEDLL LELDRKYRPL VFATYDYSSR RVLAVPHGPD PVIFGLRSLD PEILVNVAVD VLEKISHAGY LLFKTNQGTS AHLQRYKPVA LVRPYDSVVV RGSVAEKPTV ISGGHVIVNV CDETGCLQIA FYKETGRLNR VAKLLSRGDL VEVGGGVMKK DGLVLNAEYL RLIKPALNVK RLNPLCPKCG SRMTSAGKGK GYKCPKCGYR AKEAAKIYRV APRTLEPGTY FQSPSAYRHL TKPPEILGLR PVDATKVLLS GMWFM
|
| |