Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0527 |
Symbol | |
ID | 4601345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 478244 |
End bp | 480598 |
Gene Length | 2355 bp |
Protein Length | 784 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639773297 |
Product | SMC domain-containing protein |
Protein accession | YP_919936 |
Protein GI | 119719441 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAGGTTG AGGGTTACAG GTTCTTTAAA GATCCTTTTC GCGTCGAGTT TGGAGACGGA GTAACAGTCA TTGCCGGGCC TGTTGGCTCG GGTAAGAGTA GCCTTTTATC GGCAGTAGAG TATGCCCTGT ACGGGACCGA CTCCTACATC GAGAGAAGGG TCTACACGAA GAAGGACCTA GTTAACTCGG CGTCAGCGTC TCTCAGGGTT CTACTCGAAT TAGAAGTGGA GGGTGAAGGA GTTTACAGGA TCGAGAGGAG ACTTTCGAAG GAGGGCAGGG AGAGAGTGGA GGTTCTGCTC CCGGACGGGG GTATTCTGAG AGAGTCTTCA AAGGTACTGG AGAAAGTAGA GGAGCTTCTG GGGATGGACT TCTTCGAGTT TTCCCGCAGT GTTTCGATAA ACTACGTAAC GCTGTTTTTA CTCGCGCACG GGAGTCAAAG CGTTAGAAGC AGGGTTTTAG ACAGGTTGCT CGGCATAGAC GCCGTTGAGA AGCTCGCACG AGCCATAACC GCGAAACCTA TCCTCGACGA AATGAGAAGC CTCAAAGGGG ATGTCGACTT TCTACGCTCA GCGGGTTTCT CGGAGGAGAA CCTGAATATC CTTAGAGAAG AGGAGCGCAG GCTTGAGCAG GAACTAATCG AAGGCGAGAG GCTTCTTGAA GGGCTAAAAA AGGAAGCCTC CGACCTCGAA GGCGCGGCTA AGAAGTACCA GGAGCTCAAA AGCGAGCTGA ACGCGAAGGA AAAGCTACTG GAGGACCTGA GGGACAGGGT TAAAGGAGTT CTTAGCCTGG ACGCCCTGGT GGTGGGGCTC GAAGAGCTGA GAGATAAACT ACTACGCGCT GCAGAAAAGC TCCTGCCACC TTCCAAGCTC ATCAAGGAAA TAGAAGGCAT AGAGGTCTCT GAGTATAACC TTCGAGAGGT TTTCAGCGCT TTTGAGAGGG TTTTCCACCA GCTGGACGAG ATCTACTCCG AGAAGTACAG GGAGATAAAG GGGCTAAGCG CACAGGTAGA AGTCTACGAG AAGCAACTAA GGGAGATCGA AAGCCGGCTC GTAGGCTTAG AGGAGCATGT GAGTGACTAT GAGCGCGCGG AGAGCGAGAT CGAGAAGATA AAGTCGGAGT ACGGGGACGA GAACAAGCTA CGCGAAGAAA TAGGGAGGCT TGAGAGCGAG CTCGGCATGC TTCAAAGAAG GAGCGAGCTC GAGAGGTGTG TTTCCTCGGT GAGGAGGGTC TTAGCGGAGG AGGTCGCGAA GAAGGGCGAG GCGGAGTGCT ACGTATGCGG GAACAGGCTT TCGGAGGAAT TTCTCGACTG GGTGAGGGAG AAGGTCTCTA AATCGGTTAA GGAGCTAAAG GACGTAGAGG AGAGCATAGG TAAGCTTAGG GAGAGAATAA ATGTGCTTAA GAAGAAGCTG GAGGACCTCA GGGAGTACAA GCTTACCCTT ATAAACTACG AGGCGGCGTA CGAGGAGTAC CAAAGGCTAC TAGAGGAAAG GAAGAACCTT CAGGCGGCAC TCGACGCCGA GAGGGAGGAG CTAGAAAGGG CCAAGAGCGG GTTAAGCGTT ATAGGAGCAG AGCTGAAAGT TGTTAGGGAA GATTTCTTAA GGCTTAGAAC CTCGTACTCC AAGTTGCCGT TGCTAGACGA GATAAAGAAA TTGGAACAAG AAGTCTCAAG TCTCAGGTCG GAGCTTCAAA GGCTTGAGCC GCAGTACAAC AAGTACTTGG AGCTTGAGAA AAGAATTGAA GCGTTGTCGC GGGAAATAGA GGACAAGAGG CATAGATTGG AGGGTATCAG GAAAGACATC GATGAAAAGA GAGCCATGCT GGAGAGCTTC GAGGAGCGCT TAGCCAGGGA GAGAAGGCTT GAAGAAGTTC TGGGAAAAGT TCGGAGAGTT AAGGAGGCAT TAATAGAGGT GCACGCGGAT TTGAGGAGCC AGAAGATAAT GGAGCTAAAC AAGGTTGTCA ACGAGATTGT AAGGGGGATT TACCCCTACA CGGATATCGA GGAAGTTAGA GTGAGGGTTG TCTCTCCCTC GCGAAGAGTT GGAGGACGAA GCATTTACCA AGTCGAGGTT AAAGTGGGCG GCGAGTGGTA CCCGTACTCC TCTAGGCTGA GCGACGGTCA AAAAACGGTG GTTTTCCTCT CGTTGCTCAT TGGGCTTAAC AGGTTGCTTA ACAAAAGGGT AGGCTTCCTA GTGCTCGACG AGCCAGTTCC GAACGTTGAT GACGCTATCA AAGCATCCCT GCTTAAGTCA ATGTTGCTGG TAACGGGTCT GAGGCAGGCG ATTGTTACTA CCCAGGCAGA AGACATAGCG GGAAGAGTTG AAGGAGTTTC CCTGGTAAGG CTTTCCCGGA TGTAG
|
Protein sequence | MEVEGYRFFK DPFRVEFGDG VTVIAGPVGS GKSSLLSAVE YALYGTDSYI ERRVYTKKDL VNSASASLRV LLELEVEGEG VYRIERRLSK EGRERVEVLL PDGGILRESS KVLEKVEELL GMDFFEFSRS VSINYVTLFL LAHGSQSVRS RVLDRLLGID AVEKLARAIT AKPILDEMRS LKGDVDFLRS AGFSEENLNI LREEERRLEQ ELIEGERLLE GLKKEASDLE GAAKKYQELK SELNAKEKLL EDLRDRVKGV LSLDALVVGL EELRDKLLRA AEKLLPPSKL IKEIEGIEVS EYNLREVFSA FERVFHQLDE IYSEKYREIK GLSAQVEVYE KQLREIESRL VGLEEHVSDY ERAESEIEKI KSEYGDENKL REEIGRLESE LGMLQRRSEL ERCVSSVRRV LAEEVAKKGE AECYVCGNRL SEEFLDWVRE KVSKSVKELK DVEESIGKLR ERINVLKKKL EDLREYKLTL INYEAAYEEY QRLLEERKNL QAALDAEREE LERAKSGLSV IGAELKVVRE DFLRLRTSYS KLPLLDEIKK LEQEVSSLRS ELQRLEPQYN KYLELEKRIE ALSREIEDKR HRLEGIRKDI DEKRAMLESF EERLARERRL EEVLGKVRRV KEALIEVHAD LRSQKIMELN KVVNEIVRGI YPYTDIEEVR VRVVSPSRRV GGRSIYQVEV KVGGEWYPYS SRLSDGQKTV VFLSLLIGLN RLLNKRVGFL VLDEPVPNVD DAIKASLLKS MLLVTGLRQA IVTTQAEDIA GRVEGVSLVR LSRM
|
| |