Gene Tpen_0501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0501 
Symbol 
ID4601335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp455316 
End bp456596 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content50% 
IMG OID639773268 
ProductDNA primase 
Protein accessionYP_919911 
Protein GI119719416 
COG category[L] Replication, recombination and repair 
COG ID[COG0358] DNA primase (bacterial type) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGGTT TACCTGTCTC ACCTAAGTAC GTGATAAAGG CTAAGATGGA GATAAAAGGA 
TCCGTCGAAA AACACGACAT AATAGGGGCT ATCTTTGGCC AGGCAGAAGG CTTGCTCGGA
TCAGAGCTCG ACTTGAGGGA GCTTCAAAAG ACCGGCAGAG TTGGACGCAT AGAGGTGAAC
ACCCGATCGC AGGATGGCAC CCTAGTAGCC GAGATAGAGA TACCGACGAA CCTCGACATG
GCTGAGACTG CTATCATCGC CGCTACGATC GAGAGCATTG ACAAGGTGGG TCCTTACCCT
GCAAAGACGG AGGTAGTCTC GATCGAGGAT GTAAGGGCGG AGAAGAGGCA GAAGATAATT
GAGCGGGCGG TTGAGCTTTA CAAAAAGCTC CTGGAGAGCG TGCCGGAGTC CAGGGAGCTC
GTCGAGGAGG TCTTAAGGCG TGTAAGGGTA GCAGAGATAG TGGAGTACGG AGAGGAAAAG
CTGCCAGGAG GCCCCGAAGT TGAAACATCC GATACTGTAA TCCTTGTGGA GGGGCGTGCC
GACGTACAAA ACCTTCTTAG ACATGGATAT AAGAACGTTA TAGCGCTTGG AGGAGCTACC
ATACCTAAGA GCATCAAGAG TTTGGTGGAG AACAAGAAGG TTATACTTTT CGTTGACGGT
GATCGGGGAG GAGAGCTCAT AGCGAGAAAC GTTATCAACG CGCTGAAAGT CGACTTTGTT
GCAAGGGCTC CGCCGGGAAG GGAAGTCGAG GATCTAACTG CAAAGGAGAT AGCGAGGGCG
CTCCAGAATA AGATCCCGGT TGACGAGTTT CTGCAAGCTC TAGAAAGGGA GAAAAAGCAA
CAAAAAGAGG TAAAAGCAGA GATAGTAGTC CCCCCGGCAA AGCTGATAAA GTCTGCTACT
AAAAAGAACC CGGAGATCGC GCAGGAAATC GTAGTACCAG CTGAGGTTTA CGAGAAACTC
GAAGAGCTTA AAGGTACGCT GGAAGCCGTT ATCTACGATG AAAACTGGCA GGTCGTTGAA
AAAGTTCCTG TAAGGGACCT TGTCAACAGG CTTAAGGAGG TTGAACGAGC GGCGCACGTA
GTGCTCGACG GAATTATCAC TCAGCGCTTA GTGGACGTTG CGTATACGAA GGGTCTTAAA
TCACTTATTG GAGTCAGGAT AGGAGAGATA ATAAGGAAGC CAGACAACAT TGTGCTAGCA
ACCTTCAGTT CGGTGAAAAA GAGCGAGGAG AATATCCAGG AAAGTGTAAG CACTGGTGAG
AGCGCTCAGA CGAGCCCCTA G
 
Protein sequence
MGGLPVSPKY VIKAKMEIKG SVEKHDIIGA IFGQAEGLLG SELDLRELQK TGRVGRIEVN 
TRSQDGTLVA EIEIPTNLDM AETAIIAATI ESIDKVGPYP AKTEVVSIED VRAEKRQKII
ERAVELYKKL LESVPESREL VEEVLRRVRV AEIVEYGEEK LPGGPEVETS DTVILVEGRA
DVQNLLRHGY KNVIALGGAT IPKSIKSLVE NKKVILFVDG DRGGELIARN VINALKVDFV
ARAPPGREVE DLTAKEIARA LQNKIPVDEF LQALEREKKQ QKEVKAEIVV PPAKLIKSAT
KKNPEIAQEI VVPAEVYEKL EELKGTLEAV IYDENWQVVE KVPVRDLVNR LKEVERAAHV
VLDGIITQRL VDVAYTKGLK SLIGVRIGEI IRKPDNIVLA TFSSVKKSEE NIQESVSTGE
SAQTSP