Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0623 |
Symbol | |
ID | 4601410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 579491 |
End bp | 580561 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639773397 |
Product | DNA primase, large subunit |
Protein accession | YP_920030 |
Protein GI | 119719535 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2219] Eukaryotic-type DNA primase, large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.426261 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGCCAA GCTCCAGCGC ACACACGATC AGAGTTATAT TAGATACAAA GGACTACGCT AAATACCCTT TCCTGAGAGA GGCTGTAGAA AGCATCAGGG AGACTGGGTT AACGCTTGAA GATCTAGAGA GGTTCGACGC GGTAGACGAC GCGAAAGAGA AAATAAAAAG TGTAATAACG CGCGGCGAAT ACCCTTCCAT CGACGACTAC CAAAACGAAG TCCCTGCTTT TCTAACATGC ATAGTTATTC TCTCCGAGAT CGGCGACAGC GCACTGAGCG AAAGATTCGC CGTAGCATTC TCTAAAAAAG TGAGTGAAAA CCTGAAAGAG GAAGAGAAGG AGGAACGCAT CGAGATAGCT TTCCACATAG CTAGCTCTGT TCTCGGATGG TCCTTCGACG TAGACGAAAA AGGAACGTTG ATCAGGTTGA GATACAAGGA CTACCTCTCT GGGGTCCCAG AGTACACCGG GGAATGGAAG CTCGTGAACA GGTTGCTGGA TAAAGGTTAC GTGGAGGTTC GAAAAGAGAG TTTCCTGAGG CTCATAGAAA CCGGGGTGAA AAAGTACGTG TTAAGGCTTA TTGAGCAAAC TAAGGTCGAT GAGGCAAGGA TACCTCCTAA GCTTTACAGC GCAGTAGAAG AGATAGCCAG CATTTGGTCT AAGCAACGCG AGGAGCTGTT AACGTATGCG AGGAACGCCT CTAAGGGTAA AAGGGAGGAC CTCTTTCCCC CCTGCATTCG CGCGATAATC CAGGACATCT CGGCTGGGAA AAACCTTCCT CACAGCGCCA GGTTTGCGCT TGCATCTTTC CTTCTAAGCG TTGGGTTCTC TGTTGACGAG GTACTGGAGG TGTTCAAGCT TTCGCCGGAC TACCGGGAGG ATATCGCCCG GTACCAAGTG GAACACATAG CCGGTATGCG CGGCTCCAGG ACTAAGTACC TCCCATACAA GTGTGATAAC ATGAGGTCTT ACGGTCTCTG CAGGTGGAGA TGCGAAAACA TAAGGCACCC GTTACAGTTT TTCTTTAAAG CGGCGAGGGG GCGTGCACCG CGCGTCACAG AGCTCAGCTA A
|
Protein sequence | MWPSSSAHTI RVILDTKDYA KYPFLREAVE SIRETGLTLE DLERFDAVDD AKEKIKSVIT RGEYPSIDDY QNEVPAFLTC IVILSEIGDS ALSERFAVAF SKKVSENLKE EEKEERIEIA FHIASSVLGW SFDVDEKGTL IRLRYKDYLS GVPEYTGEWK LVNRLLDKGY VEVRKESFLR LIETGVKKYV LRLIEQTKVD EARIPPKLYS AVEEIASIWS KQREELLTYA RNASKGKRED LFPPCIRAII QDISAGKNLP HSARFALASF LLSVGFSVDE VLEVFKLSPD YREDIARYQV EHIAGMRGSR TKYLPYKCDN MRSYGLCRWR CENIRHPLQF FFKAARGRAP RVTELS
|
| |