Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1572 |
Symbol | |
ID | 4600558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1521708 |
End bp | 1523303 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639774345 |
Product | hypothetical protein |
Protein accession | YP_920970 |
Protein GI | 119720475 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.127794 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTATTCTT CCCTTAGGCA GGTCGGCAGG ATAGTCGAGG AGGCTACTCC GGAGAGCTTC ATCTTCGTCA CGACGAAGCA GGACCACCCG CCGAAGTACG AGTACGTGCT GGTTAAGTCG AGGGAGGTGG TCGCCGGCGA GGAGAGGGAG GTGGACGTCC TCTGCCAGGT GACCGGCGTG GTTTCCAGGA GCGACGCGTA CAGCAGTAGG CTAGACCTGG AGAGCCTAGA GCGGATACAC GAGGCTGGGA TAGACGACGC CAACCTTCTC TGCGCGGCGC GGACCCTCGG CTACCTCGCC GAGGAGGACG GGAGGAAGGT GGTCCTGATG CCCAGGAGGG CTTTCTTCCC GGGAAACCCG GTCTACCTGG CCCCGGACGA CTTCGTCAGG GAGTTTTTCT CGTACCCCGG GGAGGAAGGT ATACGCATAG GTAGCCTCGT CTCCCGCAGG AACGTCGACG TCTACCTCTC GGTGAACGGG TTCAGGAGGC ACGTCGCCGT TATAGCCCAG ACCGGCGCGG GCAAATCGTA CACCGTCGGC GTCATACTGG AGGAGTTGCT GAGGCTGGGA GCCACGGCGG TAGTCATAGA CCCGCACGCG GACTACGTGT TCCTGAGCAG GGACAGGGAC ATGAGGCGCC ACGAGTACTC GGACAGGGTC CTCGTCTTCA GGAACCCGAA CAGCACTGGG CGCTACGACC CGAGCCAGAT GGACAACGTG CACGAGCTGA CGGTGAAGTT CTCCGATCTC TCGGCGGAGG ACGTGGCGAG GATAGCCGGG ATTCCGGAGA AGTGGACGAA CGTAAGGAAG GCTATCCGCG ACGCCCTCGA CAAGCTCAGG GGGAGGGACT ACACGATTGA CGACCTGCTC GGCGAGCTGG AGAAGATGTC GCGGGGAGGC GGCAAGGAGG CTGCCTACGC GTCGAGCGCG TACAACCACC TCGTGAAGCT GAGGAAGTTC AGCGTGTTCG GCAGGTACAC CACGTCCGTC AAGGACGAGA TCCTGAAGCC GGGGCACGTG AGCGTGCTGG ATCTGTCCGG GCTTAACGAC GCGAGCCAGG ACTACATAGT GAGCACCGTG CTAGAGGAGA TATACAGGCT GAGGTACTCG GGGGAGTTCA GGTACCCCGT CTTCGTGGTG GTAGAGGAGG CTCATAGGTT CGTCCCATCC AAGGCCTCGA AGAGGTCCAC CATGTCCTCC GAGATCATAA ACACGATAGC GGCGGAGGGC AGGAAGTTCG GCGTGTTCCT GATCCTTGTG ACCCAGAGGC CGAGCAAGAT AGACGCGGAC GCCCTGAGCC AGTGCAACAG CCAGATAATA CTGAGGATCA CTAACCCGAG CGACCAGAGG GCTGTCGCCG AGGCTAGCGA GAGGCTGGGG GAGGACCTCA TGAGGGACCT ACCTGGGCTC AACGTCGGCG AGGCCATAAT AGTTGGGGAG CTGACCCGGG TACCCGTCGT GGTGAAGGTG AGGCGCAGGT TTACGCGGGA GGGGGGCGCC GACATAGACC TCGTCGCAGA GCTCCGGAGA GCCCGCGAGG AGCTCAGCCT CGCGTCAAGG ACATATAGGG GCGAGGACCT ACTATCCGAG GTGTAG
|
Protein sequence | MYSSLRQVGR IVEEATPESF IFVTTKQDHP PKYEYVLVKS REVVAGEERE VDVLCQVTGV VSRSDAYSSR LDLESLERIH EAGIDDANLL CAARTLGYLA EEDGRKVVLM PRRAFFPGNP VYLAPDDFVR EFFSYPGEEG IRIGSLVSRR NVDVYLSVNG FRRHVAVIAQ TGAGKSYTVG VILEELLRLG ATAVVIDPHA DYVFLSRDRD MRRHEYSDRV LVFRNPNSTG RYDPSQMDNV HELTVKFSDL SAEDVARIAG IPEKWTNVRK AIRDALDKLR GRDYTIDDLL GELEKMSRGG GKEAAYASSA YNHLVKLRKF SVFGRYTTSV KDEILKPGHV SVLDLSGLND ASQDYIVSTV LEEIYRLRYS GEFRYPVFVV VEEAHRFVPS KASKRSTMSS EIINTIAAEG RKFGVFLILV TQRPSKIDAD ALSQCNSQII LRITNPSDQR AVAEASERLG EDLMRDLPGL NVGEAIIVGE LTRVPVVVKV RRRFTREGGA DIDLVAELRR AREELSLASR TYRGEDLLSE V
|
| |