Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0418 |
Symbol | |
ID | 4600496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 380439 |
End bp | 381863 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639773183 |
Product | hypothetical protein |
Protein accession | YP_919830 |
Protein GI | 119719335 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGTGTG AAGAGAGGCT TGAAGAAGAG GTTAGGGAGG AGACGTGGAG GAGGGCGGCG CTAAGCCTCT ACTCGACCGA GGTCGAGGAT AGGAGGAGGT CTAGGGGCGG GAAGAAGGGC AGGGTACACT ATCGCGGCCT CTACGACGCA GTGTCTAGGA TTAACTGGGA CTTTACGCGC TTCGCGGCGC ACGCGCTGAG CGTCGTGCCG GACGAGGCAT ACTCCAGGTT CGGCAGGCTG TTTGACATCG ACGCGAGGAA GTACCTCCTG CTGAACGACG ACGAGAAGCC GAGGGAGGGA GGCGCAGTTG TGGAGCTACG CGATAGGCTA CAGGCAATCG TCGACGCTAC CGAAGACGGC CTCAAAGTTG AGAGACACGG AAAGCACTGG CAAGTATACA TACCTGGAGA AAACTGGCAC GTCATCGCTT ACAAGCCTAC ACATAACTGG ATCATACTCA TCCCGTTGAA AGGCTTCTGG GTTGAGTCAG AGTTTCCCAG GGTTCTAGTG AATACTCCGA TCAGCGTTTT GCAGAGCCTG CAGAAAGGGT GGGTCTTAAC GGATGTGACA CCCCCTCATG GGCGCTACAG CTACGTACAA TTCAGCACCA CTCAACCGTG GCAGTTGCCA GCCACGCTCG CGGCCTTTCC TAGTGACAAT ATCTGCTTGG GCGTTACGGC GGGCGCACTC GGCAGTACCA AGCTAAGCAT CGAGTGGAAG GTGCACGTCT ATAGCTACGA GGAGGAGCTG GGCTGGGCCT CCGAACTCAT CGGCGAGGTT AAACGTGCAG AGTTCCGCCA GCTTATCGAT GCTTGCAGGG TGCTACAGGG AGACCCGGTC GCTCTACACA CAGCCTTCCT GGGGGACGGC TACCTCGCCT TCTTCCTGAG GCTTCGGATG CTCTTCTTCA GTATTGGACA CGAGATTTTC TACCTCCCAG CTGAGAGCGC TATAGTCAAC GCTAGACTCG CCGTCGAGCT AGCACCAGAG TACACAAAGT TCGTCTCATT AGTGACGAAG TGTCCAAAGA TTAAACACTT CTTGTTCGTC GGCTTCGGAT TGCCGCAAAA GAGAGGTATG AGAGACGGTC AGAGAAAGAC CCCGTTCTAC GCCGAGATAG CGGGGGCTAG GCTACACCTA GTCTACATTT CTACGAGGAA TCATGTCTAC GCGAGGATCG CGGTCGACGA TGCGCCTCAA GGCTGGGTGG AGGAGGCGCG CGCTCAGGGC TGGGACGTCC GGGTGGTTAA CATGGGAAGC GAGGAGTACT ACCAGGTTAC ACATGTCTCT TTGATGGAGC ACGCGCGCTA CGACGAAGCA CTGCGGGAAA CACTCCTCGC CTTCGCGAAG GCGAAAGCCG AGCAGTACCC CAAAGCCTGG GAACTCGTAG AGCGCCTCGA AAAGCTGGGG ACAGGGCAGG AATAG
|
Protein sequence | MGCEERLEEE VREETWRRAA LSLYSTEVED RRRSRGGKKG RVHYRGLYDA VSRINWDFTR FAAHALSVVP DEAYSRFGRL FDIDARKYLL LNDDEKPREG GAVVELRDRL QAIVDATEDG LKVERHGKHW QVYIPGENWH VIAYKPTHNW IILIPLKGFW VESEFPRVLV NTPISVLQSL QKGWVLTDVT PPHGRYSYVQ FSTTQPWQLP ATLAAFPSDN ICLGVTAGAL GSTKLSIEWK VHVYSYEEEL GWASELIGEV KRAEFRQLID ACRVLQGDPV ALHTAFLGDG YLAFFLRLRM LFFSIGHEIF YLPAESAIVN ARLAVELAPE YTKFVSLVTK CPKIKHFLFV GFGLPQKRGM RDGQRKTPFY AEIAGARLHL VYISTRNHVY ARIAVDDAPQ GWVEEARAQG WDVRVVNMGS EEYYQVTHVS LMEHARYDEA LRETLLAFAK AKAEQYPKAW ELVERLEKLG TGQE
|
| |