Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1095 |
Symbol | |
ID | 4600962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1033331 |
End bp | 1034578 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639773872 |
Product | hypothetical protein |
Protein accession | YP_920497 |
Protein GI | 119720002 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0776511 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGCTA TAGGGGTAGC CCTCGGAGCC GTACCCATAA CCCCCTCCCC GCCCGCGGGC CACGAGCTCG CAGGCTACAT AGCCAGGCAG GGGCGCTCGC TCGGAGCCCA CGACGACGTC GAGGCTAGGT GCATGCTTAT AGACTGGCAA CCGGCAGTCC TGCTGGTAAA CCTAGACCTT CTCGGCGTCG ACTCGGGGAT AGTTGAAACC GTTCACAGGG TGGCGGAAAG GGAGGTCGGG GCGGTGGAGG TCGTTGTAAG CGCTACGCAC ACCCACTCCG CGCCAGCAAC ACTCTTCACA AACCCACTGC TGACGTTCGG GGGGAGTTTC CTGCGCCGAG ACTACCTGGC ATACTTCGAA GAGAGGCTCA GAATCCTTTT TCAGGACCTC GCGGGGAGGG TTGAGCGGCA CGATGTACTC GTAGGGAAGG GATCCGTGAG CGGCGTGGTT ACGGACAGGG AGGATCCAGG CAGAAAGGTG GACGACGAGG CACTGGTAGT CGTGTTTAAA AGGGGGCTCC AGACGGCTGG GTACCTGCTT AACTACGCGG TACACCCCAC GGTTCTGGGC CCCGACAACC TGCTGATCTC GAAGGACCTC GTAGACCCCG TCCTCAGGTA CCTGGGCGCG GCGTGGAAAG CGGGGGTAGG CTTGTTCCTG AATGGCGCGG CTGCAAACGT GTCTACCAGG TTTACGAGAA GGGGGCAAAC ATTCGAGGAA GCAGAAAGGC TCGGAAAGCT ACTCGTGGAC CAACTCCTCC TGTTACCTCT TAGAGGCATG CCCGGGGATG CCGCAGAGGT GGATTTACAA GCCAAAAGGG TCAGTGTGTC CTTTAGGCAA CCGTCGCCGG AGGAACTGTC AGCAGCGTTA CGCGTGGGTG CAGGGTCGGC GCACCCCAGG GTGAGGGAAG CTATCCTCGA AGGCGTGAAA GCGCTGGAGA GGATAACCCC CTACCTATCC TCCGCTGGGC GCAGCGAGGT CGAGCTACGT GTTCTCAGGA TAGGGTCCTT GAGGCTTCTC TTCGCGCCCT TCGAGCTCCA CTCCGACTTC TCTCTACGAC TGAAGCGGTT TGCCGGATCC GGTAAGCTGG GGCTCGTAGG CTACTCGGGG GAGTATCTAG GCTACCTCGT CCCAAGGGAC TACGCGCTCG GCTATGAGTC TGTCATGCAG GTTCTCGACG AGGAGAGCTC GGAGAGGGTT TTAAGGGGGT TAGAGGAGGT GTTAGACTTC GATCCACTTC CCGCTTGA
|
Protein sequence | MAAIGVALGA VPITPSPPAG HELAGYIARQ GRSLGAHDDV EARCMLIDWQ PAVLLVNLDL LGVDSGIVET VHRVAEREVG AVEVVVSATH THSAPATLFT NPLLTFGGSF LRRDYLAYFE ERLRILFQDL AGRVERHDVL VGKGSVSGVV TDREDPGRKV DDEALVVVFK RGLQTAGYLL NYAVHPTVLG PDNLLISKDL VDPVLRYLGA AWKAGVGLFL NGAAANVSTR FTRRGQTFEE AERLGKLLVD QLLLLPLRGM PGDAAEVDLQ AKRVSVSFRQ PSPEELSAAL RVGAGSAHPR VREAILEGVK ALERITPYLS SAGRSEVELR VLRIGSLRLL FAPFELHSDF SLRLKRFAGS GKLGLVGYSG EYLGYLVPRD YALGYESVMQ VLDEESSERV LRGLEEVLDF DPLPA
|
| |