Gene Tpet_0414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0414 
Symbol 
ID5171180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp399360 
End bp401159 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content47% 
IMG OID640562913 
Productrecombination factor protein RarA/unknown domain fusion protein 
Protein accessionYP_001244014 
Protein GI148269554 
COG category[L] Replication, recombination and repair
[R] General function prediction only 
COG ID[COG2110] Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1
[COG2256] ATPase related to the helicase subunit of the Holliday junction resolvase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000174137 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTATTT CAGAAAAACC CTTGAATGAA CTGCTCAGGC CGAAAGATTT CGAGGATTTC 
GTCGGTCAGG ATCATATATT CGGCGATAAA GGGATTCTCC GGCGAACTTT AAAAACAGGC
AACATGTTCT CCTCCATCCT TTATGGACCA CCGGGGTCTG GTAAGACCTC GGTTTTTTCA
CTGCTGAAAA GGTATTTCAA CGGCGAGGTA GTTTATCTGA GCTCCACCGT TCACGGGGTT
TCTGAAATAA AGAACGTTCT CAAAAGAGGA GAACAGCTGA GAAAATACGG AAAAAAACTG
CTTCTTTTTC TCGATGAAAT ACACCGCCTG AACAAAAACC AGCAGATGGT CTTGGTTTCC
CATGTTGAAC GAGGAGACAT CGTACTGGTT GCGACAACAA CTGAAAACCC AAGTTTTGCC
ATCGTACCCG CCCTTCTCTC GAGATGCAGG ATTCTTTATT TCAAAAAACT CTCTGACGAG
GATCTGATGA AGATCTTGAA GAAAGCGACA AGGGTTCTCA AACTCGATCT GGAAGAGACA
GCAGAAAAGG CCATAGTAAA GCACTCCGAA GGAGACGCCA GGAAACTCTT GAACACCCTG
GAGATCGTCC ACCAGGCGTT CAAAAACAAG AGGGCAACTC TTGAAGATCT GGAAACTCTG
CTGGGAAATG TGAGCGGGTA CACGAAGGAA TCACACTACG ATTTCGCTTC AGCCTTTATA
AAGAGTATGA GGGGTAGCGA TCCAAACGCC GCCGTTTACT ACCTCGTCAA GATGATAGAG
ATGGGAGAAG ACCCGCGATT CATAGCACGA AGAATGATCA TATTCGCCAG CGAAGACGTT
GGACTCGCCG ACCCAAACGC TCTACATATC GCCGTTTCGA CCTCCATCGC TGTCGAACAC
GTGGGACTTC CCGAATGCTT GATGAACCTT GTAGAGTGTG CCATCTACCT TTCCCTCGCT
CCCAAGAGCA ATTCTGTTTA TCTTGCAATG AAAAAGGCCC AGGAACTTCT TGTGGAGGAC
GTACCGCTTT TTTTGAGAAA TCCCGTCACC GAAGAAATGA AAAAGCGTGG ATACGGGGAA
GGGTATCTTT ATCCTCATGA TTTCGGCGGC TTTGTGAAAA CGAACTATCT TCCAGAAAAG
TTGAAAGGTG AGGTCATCTT CCAGCCAAAA AGAGTAGGTT TTGAGGAAGA ACTCTTTGAA
AGGCTCAAAC GTCTTTGGCC CGAGAAGTAC GGGGGTGAAA GTATGGCCGA AGTGAGAAAA
GAACTGGAAT ACAAAGGGAA AAAGATCAGA ATCGTAAAGG GAGACATCAC AAGAGAAGAA
GCGGACGCCA TAGTGAACGC GGCGAACGAA TATCTGAAAC ACGGAGGAGG AGTAGCGGGA
GCGATCGTGA GAGCGGGTGG AAGCGTTATT CAGGAGGAAA GCGACAGGAT CGTTCAGGAG
CGTGGAAGAA TTCCGACAGG CGAAGCGGTG GTAACCGGCG CTGGAAAGCT GAAGGCAAAA
TACGTGATTC ACGCTGTGGG ACCCGTTTGG AGAGGAGGCA GTCATGGAGA GGACGAACTC
CTCTACAAAG CAGTTTACAA CGCACTCCTT CGAGCTCACG AACTGAAATT GAAAAGCATC
TCAATGCCTG CCATCAGCAC AGGAATCTTC GGATTTCCGA AAGAAAGAGC GGTGGGAATC
TTTTCAAAGG CAATAAAAGA TTTCATCGAT CAACATCCGG ATACCGCTCT GGAAGAAATC
CGCATATGCA ATATAGACGA GGAAACAACG AAAATTTTCG AAGAAAAGTT CAGCGTTTGA
 
Protein sequence
MSISEKPLNE LLRPKDFEDF VGQDHIFGDK GILRRTLKTG NMFSSILYGP PGSGKTSVFS 
LLKRYFNGEV VYLSSTVHGV SEIKNVLKRG EQLRKYGKKL LLFLDEIHRL NKNQQMVLVS
HVERGDIVLV ATTTENPSFA IVPALLSRCR ILYFKKLSDE DLMKILKKAT RVLKLDLEET
AEKAIVKHSE GDARKLLNTL EIVHQAFKNK RATLEDLETL LGNVSGYTKE SHYDFASAFI
KSMRGSDPNA AVYYLVKMIE MGEDPRFIAR RMIIFASEDV GLADPNALHI AVSTSIAVEH
VGLPECLMNL VECAIYLSLA PKSNSVYLAM KKAQELLVED VPLFLRNPVT EEMKKRGYGE
GYLYPHDFGG FVKTNYLPEK LKGEVIFQPK RVGFEEELFE RLKRLWPEKY GGESMAEVRK
ELEYKGKKIR IVKGDITREE ADAIVNAANE YLKHGGGVAG AIVRAGGSVI QEESDRIVQE
RGRIPTGEAV VTGAGKLKAK YVIHAVGPVW RGGSHGEDEL LYKAVYNALL RAHELKLKSI
SMPAISTGIF GFPKERAVGI FSKAIKDFID QHPDTALEEI RICNIDEETT KIFEEKFSV