Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1685 |
Symbol | |
ID | 4600573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1634106 |
End bp | 1635266 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639774458 |
Product | hypothetical protein |
Protein accession | YP_921083 |
Protein GI | 119720588 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000183285 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGGAGTTA GGGCCGAGTT TCTGAGGCTA CTGGAGGAGG ACGTGGAGTT TAGGTATGCA GTCGCAGGCT ACCTGGGAGT GTTAGAGGTT TTGAGGAGGC TAGACTCCCT AGCCGAGGAG ATGAAGAGGC TCAGAGAAGA AATGCACGCG GGCTTCGCGA AGCACGGGGA AATACTTGAA AGGCTCGAAA AGATCCTCGA GAAGCACGAA GAGGCCTTGA GAAGACACGA TGAGGAGCTA ATCAGGCTTA GAGAGGACAT GAATAAGGGG TTTGCCAGGC ACGACGAGCA ACTCGCAAAG CTTAGGGAAG ACATGAACTC CGGCTTCGCG AGGCACGACG AAGCATCGAG AAGGCACGAG GAGCAACTTG TGAAGCTTAG AGAGGACTTC AACAAGCTGA GGGAGGATAT GAACAAGGGC TTCGCTAGAC ACGACGAGGC TCTTAAAAGG CATGAAGAAG AACTCGTGAA GCTTAGGGAG GACATGAACG CCGGCTTTGC TAGACAAGAC AAAGAGCTCA CCAGGCTTAG AGAGGACATG AACGCCGGTT TCGCTAGGCA TGACGAGTTA TTCAGGAGGC ATGAAGAGGA GATGAAGGGA CTACGGGAGG ATATGAATAA GGGATTTGCC AGGCACGATG AAGCGTTGAG GAGGCACGAG GAAGAGCTAG TAAAGCTCAG AGAGGACATG AACAAAGGCT TTGCTAGACA CGACGAGGCG CTGAGGAGGC ATGAGGAGAT TTTAGAGAAG CATAGCGAGG AGTTAGCAAA GTTGAGGAGC GCTATGATTG CCGGTTTTGG GGAGTTAAGC AAGTTTGCGG GTATGACCTT CGAGGAATTT GTTAGGAAGT TCCTGACCGC TTACCTCAGG GAGGCGGGCG AAGTCCCGGA AGGCTCAGAG CTGAGGAGGG AGGTCGTAGA AGGCGAGGAG ATAGACCTCT TCCTCGAAGA ACCACTCATA GTCGGAGAAG TAACGGCACA CGCAGAGTCC CTGGAGGAGC TTGAGAAGCT AGTCAGGAAG GCGGAGCTGG TGAAAGCCAA GTACGGTAAA GAACCCAGGA AAATCCTAGT CATACTTACA GCGCCCAGGG ACTTAGCAGA GAAACTTGAA AAAATTGCCG GCGAGAAGGG CGTAGAGCTC ATAATAGGGA GAACAGCCTA G
|
Protein sequence | MGVRAEFLRL LEEDVEFRYA VAGYLGVLEV LRRLDSLAEE MKRLREEMHA GFAKHGEILE RLEKILEKHE EALRRHDEEL IRLREDMNKG FARHDEQLAK LREDMNSGFA RHDEASRRHE EQLVKLREDF NKLREDMNKG FARHDEALKR HEEELVKLRE DMNAGFARQD KELTRLREDM NAGFARHDEL FRRHEEEMKG LREDMNKGFA RHDEALRRHE EELVKLREDM NKGFARHDEA LRRHEEILEK HSEELAKLRS AMIAGFGELS KFAGMTFEEF VRKFLTAYLR EAGEVPEGSE LRREVVEGEE IDLFLEEPLI VGEVTAHAES LEELEKLVRK AELVKAKYGK EPRKILVILT APRDLAEKLE KIAGEKGVEL IIGRTA
|
| |