Gene Tpen_0527 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0527 
Symbol 
ID4601345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp478244 
End bp480598 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content51% 
IMG OID639773297 
ProductSMC domain-containing protein 
Protein accessionYP_919936 
Protein GI119719441 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGGTTG AGGGTTACAG GTTCTTTAAA GATCCTTTTC GCGTCGAGTT TGGAGACGGA 
GTAACAGTCA TTGCCGGGCC TGTTGGCTCG GGTAAGAGTA GCCTTTTATC GGCAGTAGAG
TATGCCCTGT ACGGGACCGA CTCCTACATC GAGAGAAGGG TCTACACGAA GAAGGACCTA
GTTAACTCGG CGTCAGCGTC TCTCAGGGTT CTACTCGAAT TAGAAGTGGA GGGTGAAGGA
GTTTACAGGA TCGAGAGGAG ACTTTCGAAG GAGGGCAGGG AGAGAGTGGA GGTTCTGCTC
CCGGACGGGG GTATTCTGAG AGAGTCTTCA AAGGTACTGG AGAAAGTAGA GGAGCTTCTG
GGGATGGACT TCTTCGAGTT TTCCCGCAGT GTTTCGATAA ACTACGTAAC GCTGTTTTTA
CTCGCGCACG GGAGTCAAAG CGTTAGAAGC AGGGTTTTAG ACAGGTTGCT CGGCATAGAC
GCCGTTGAGA AGCTCGCACG AGCCATAACC GCGAAACCTA TCCTCGACGA AATGAGAAGC
CTCAAAGGGG ATGTCGACTT TCTACGCTCA GCGGGTTTCT CGGAGGAGAA CCTGAATATC
CTTAGAGAAG AGGAGCGCAG GCTTGAGCAG GAACTAATCG AAGGCGAGAG GCTTCTTGAA
GGGCTAAAAA AGGAAGCCTC CGACCTCGAA GGCGCGGCTA AGAAGTACCA GGAGCTCAAA
AGCGAGCTGA ACGCGAAGGA AAAGCTACTG GAGGACCTGA GGGACAGGGT TAAAGGAGTT
CTTAGCCTGG ACGCCCTGGT GGTGGGGCTC GAAGAGCTGA GAGATAAACT ACTACGCGCT
GCAGAAAAGC TCCTGCCACC TTCCAAGCTC ATCAAGGAAA TAGAAGGCAT AGAGGTCTCT
GAGTATAACC TTCGAGAGGT TTTCAGCGCT TTTGAGAGGG TTTTCCACCA GCTGGACGAG
ATCTACTCCG AGAAGTACAG GGAGATAAAG GGGCTAAGCG CACAGGTAGA AGTCTACGAG
AAGCAACTAA GGGAGATCGA AAGCCGGCTC GTAGGCTTAG AGGAGCATGT GAGTGACTAT
GAGCGCGCGG AGAGCGAGAT CGAGAAGATA AAGTCGGAGT ACGGGGACGA GAACAAGCTA
CGCGAAGAAA TAGGGAGGCT TGAGAGCGAG CTCGGCATGC TTCAAAGAAG GAGCGAGCTC
GAGAGGTGTG TTTCCTCGGT GAGGAGGGTC TTAGCGGAGG AGGTCGCGAA GAAGGGCGAG
GCGGAGTGCT ACGTATGCGG GAACAGGCTT TCGGAGGAAT TTCTCGACTG GGTGAGGGAG
AAGGTCTCTA AATCGGTTAA GGAGCTAAAG GACGTAGAGG AGAGCATAGG TAAGCTTAGG
GAGAGAATAA ATGTGCTTAA GAAGAAGCTG GAGGACCTCA GGGAGTACAA GCTTACCCTT
ATAAACTACG AGGCGGCGTA CGAGGAGTAC CAAAGGCTAC TAGAGGAAAG GAAGAACCTT
CAGGCGGCAC TCGACGCCGA GAGGGAGGAG CTAGAAAGGG CCAAGAGCGG GTTAAGCGTT
ATAGGAGCAG AGCTGAAAGT TGTTAGGGAA GATTTCTTAA GGCTTAGAAC CTCGTACTCC
AAGTTGCCGT TGCTAGACGA GATAAAGAAA TTGGAACAAG AAGTCTCAAG TCTCAGGTCG
GAGCTTCAAA GGCTTGAGCC GCAGTACAAC AAGTACTTGG AGCTTGAGAA AAGAATTGAA
GCGTTGTCGC GGGAAATAGA GGACAAGAGG CATAGATTGG AGGGTATCAG GAAAGACATC
GATGAAAAGA GAGCCATGCT GGAGAGCTTC GAGGAGCGCT TAGCCAGGGA GAGAAGGCTT
GAAGAAGTTC TGGGAAAAGT TCGGAGAGTT AAGGAGGCAT TAATAGAGGT GCACGCGGAT
TTGAGGAGCC AGAAGATAAT GGAGCTAAAC AAGGTTGTCA ACGAGATTGT AAGGGGGATT
TACCCCTACA CGGATATCGA GGAAGTTAGA GTGAGGGTTG TCTCTCCCTC GCGAAGAGTT
GGAGGACGAA GCATTTACCA AGTCGAGGTT AAAGTGGGCG GCGAGTGGTA CCCGTACTCC
TCTAGGCTGA GCGACGGTCA AAAAACGGTG GTTTTCCTCT CGTTGCTCAT TGGGCTTAAC
AGGTTGCTTA ACAAAAGGGT AGGCTTCCTA GTGCTCGACG AGCCAGTTCC GAACGTTGAT
GACGCTATCA AAGCATCCCT GCTTAAGTCA ATGTTGCTGG TAACGGGTCT GAGGCAGGCG
ATTGTTACTA CCCAGGCAGA AGACATAGCG GGAAGAGTTG AAGGAGTTTC CCTGGTAAGG
CTTTCCCGGA TGTAG
 
Protein sequence
MEVEGYRFFK DPFRVEFGDG VTVIAGPVGS GKSSLLSAVE YALYGTDSYI ERRVYTKKDL 
VNSASASLRV LLELEVEGEG VYRIERRLSK EGRERVEVLL PDGGILRESS KVLEKVEELL
GMDFFEFSRS VSINYVTLFL LAHGSQSVRS RVLDRLLGID AVEKLARAIT AKPILDEMRS
LKGDVDFLRS AGFSEENLNI LREEERRLEQ ELIEGERLLE GLKKEASDLE GAAKKYQELK
SELNAKEKLL EDLRDRVKGV LSLDALVVGL EELRDKLLRA AEKLLPPSKL IKEIEGIEVS
EYNLREVFSA FERVFHQLDE IYSEKYREIK GLSAQVEVYE KQLREIESRL VGLEEHVSDY
ERAESEIEKI KSEYGDENKL REEIGRLESE LGMLQRRSEL ERCVSSVRRV LAEEVAKKGE
AECYVCGNRL SEEFLDWVRE KVSKSVKELK DVEESIGKLR ERINVLKKKL EDLREYKLTL
INYEAAYEEY QRLLEERKNL QAALDAEREE LERAKSGLSV IGAELKVVRE DFLRLRTSYS
KLPLLDEIKK LEQEVSSLRS ELQRLEPQYN KYLELEKRIE ALSREIEDKR HRLEGIRKDI
DEKRAMLESF EERLARERRL EEVLGKVRRV KEALIEVHAD LRSQKIMELN KVVNEIVRGI
YPYTDIEEVR VRVVSPSRRV GGRSIYQVEV KVGGEWYPYS SRLSDGQKTV VFLSLLIGLN
RLLNKRVGFL VLDEPVPNVD DAIKASLLKS MLLVTGLRQA IVTTQAEDIA GRVEGVSLVR
LSRM