Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0783 |
Symbol | |
ID | 4601131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 733469 |
End bp | 734596 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639773559 |
Product | monooxygenase, FAD-binding |
Protein accession | YP_920188 |
Protein GI | 119719693 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGACGTAC TCGTACTGGG CGGGGGCCCT GCCGGTTTGC AGCTGGCCAG GTTCCTCAAG GGCTACGGCG ACGGCGATGT ATACGTCTAC GAGGAGCACG AAAGGGTCGG GCTACCCCAG CACTGCACCG GGCTTGTCAG CATAGAGGGA CTTAGGAAGT GGATAGGTGT CGGCGAGCGG GGGCTCGTCC TAAACAGGTT TAGGGGCGCG CGCTTCGTCT CCCCCTCCGG GAAAGTTTTC CTGGCAAGGC GGGGCTCCGA GGTTGCTGCT ATAATCGAGA GGAAGCTTTT AGAGGAAAAG CTTTACGAGG AGGCAGTATC GGCCGGTGCC CGGGTACTTC TAGGCGCGCG GCAAACGCTG GGAGGCTTCG CGCTTTCCGT GAGGAAGAGG GGGGCGATAG GGGTTATCGC CGGCGGTACG GGCTTCTTGT CGGTACTGCA CGGTGAGAAA AGAGAGTTCC TGCTACCGGC GCTGCAGCTC GATGTACGCG TGGAGGACGA GGTTGGGGAT ACAGACCATC TCTACGTGTT TCTGGGCGAG AAGTTCTCCC GAGGGCTATT CGCGTGGGCA ATCCCGCTCG AGGACGGAAC CTACAGGGTA GGCCTTGCTA GTAGGGGCAA CGTCCTCCTA AGGCTGAAGT ACTTGATGCT CGCGCTGTCG CGTGTAGGCG TGAAGGTGGT GAAAAGGCTG AGAGTGTTCG GCGGCGCCGT CTATACGGGC GGAATGGTCG ACGTCTACGC CGGGGACAGG CTTTTCTTAC TGGGAGACTC GGCTGGGCAG ACGAAGCCCA CGACTGGGGG TGGTCTCGTG TACCTTTCGA TCGCGGCGCG TGCACTGTCG GATGCGATAC TGAGTGATAG ACCGGAGGCT TACGGAGAGG CTGTTAAGCG GGCCTTGGGC AGGGAGATGC ACGTACAGTT GCTCGTTAGG AAAGCCTTGA ACTCTCTTTC GGACGCAGAG CTGGACGAGC TCTTCCAGGC GCTGAAAGAA GTGGGAGGAG AAGAGATCGT GGCCAGCGAG GGCTCCATGG ACGTTCAATC GGCGGTGGCG CTGAAGCTCT CCGCGAAACT TTTCCTCTCG CGCCCAACCC TCCTCGCAAG CGCAGCTCTA AAGTCGCTGG CCTTCTAG
|
Protein sequence | MDVLVLGGGP AGLQLARFLK GYGDGDVYVY EEHERVGLPQ HCTGLVSIEG LRKWIGVGER GLVLNRFRGA RFVSPSGKVF LARRGSEVAA IIERKLLEEK LYEEAVSAGA RVLLGARQTL GGFALSVRKR GAIGVIAGGT GFLSVLHGEK REFLLPALQL DVRVEDEVGD TDHLYVFLGE KFSRGLFAWA IPLEDGTYRV GLASRGNVLL RLKYLMLALS RVGVKVVKRL RVFGGAVYTG GMVDVYAGDR LFLLGDSAGQ TKPTTGGGLV YLSIAARALS DAILSDRPEA YGEAVKRALG REMHVQLLVR KALNSLSDAE LDELFQALKE VGGEEIVASE GSMDVQSAVA LKLSAKLFLS RPTLLASAAL KSLAF
|
| |