Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1202 |
Symbol | |
ID | 4600394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1140481 |
End bp | 1141692 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639773978 |
Product | saccharopine dehydrogenase |
Protein accession | YP_920603 |
Protein GI | 119720108 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1748] Saccharopine dehydrogenase and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATAG TTGTTGTCGG TTGCGGGGCT GTCGGCTCCC TCGTGGCTAG GCTCGCGGCC AAGTGGAAGG TTGCGGACGA GGTTCTGTGC TTGGACAAGG ACGTGGAGAG GGCTAAGAGG TACCTCGACT ACCCGGAGCC GCTAGGCATA CCGGTGGAGA GGGCGGACGC CCTCGCCGCC GAGGAGCTAA AGGCGAAGGT AGCGGGCTAC GACTTCTTGG TGAACTCTCT CCCGACGTTC GTAAAGGTCG ACAAGGCTGA AAGGTTGCTC AACCCGCAGC TAATGAGCGT CGCGCTGAAA GCGGGGCTCA ACTACGCGGA CCTCGCCTGC TACGGGGGGA AGAGGAGGAG GGCCGAGCAG CTCTCCTTCT CCAAGGCGTT CAGCGAGGCG GGCCTTCTCG CCCTCATAAA CATGGGGGCC TCCCCCGGCC TTTCCAACAT ACTCGCGAGG GAGGTCTACG AGGATCTCGA CTCGGCGGAG TCTCTCTACG TGATGTCCCT CGAGGACCAG AGGGGGAGCT CGTTCGTGAT TCCGTGGTCG AGGGAGGAGA TGCTCAACGT TGCTTCGCCT GAGCTGTGTT TCCGCGGCAG GAAGTACTCC CTCAGGGAGC CCTTCTCCGA GAGCGCGCTC TGCAACTTCC CGGAGCCCAT AGGCCCCGTT AGGTGCTACT CCGTCTCTAA TGACGAAGCC TACACGATCC CGGCTTTCCT GAGGATCTCG AACTTCTACT ACCTGGCCGG CGGGAGCGAC ATAGAAGTCC TGAGGGCTCT GTACAGGCTC GGCATACTGA GCGACGTGCC CGTGAAGCTA CGCAAGGCGA CGGTAACCCC CAGGGAGCTA CTCTACCACA TCCTGCCCCC GACCCCCTCC CCCGAGTACA TAGTCAGGGT GGTAAAGGAG GGGGACCTCG AGGACGCCTA CTTCGCGCTA CAGGTGTACG CCGAGGGCGA GGTTAGAGGC GAGAGGGCGG TCTCGAAGAG GTACCTCGTC TTCCCATCGC AGAGAAGGGT AAACGAGCTG ATGCCGGGGG CTACCTACAT CACGTACCCC ACGGCTCTGA GCCTCCTAGC CGTGCTCAGC GCGGTTAAGG GGAGAAGGCT TAGGGGGGTC GTACCCGGTG AAGCCCTACC CGGGCCCATT AGGCGCGCGG TACTGGACTA CCTGAGGGTT CAGGGGATAA CTGTAGGCGA GGAGTTCAGG ACCGTTGCCT AG
|
Protein sequence | MKIVVVGCGA VGSLVARLAA KWKVADEVLC LDKDVERAKR YLDYPEPLGI PVERADALAA EELKAKVAGY DFLVNSLPTF VKVDKAERLL NPQLMSVALK AGLNYADLAC YGGKRRRAEQ LSFSKAFSEA GLLALINMGA SPGLSNILAR EVYEDLDSAE SLYVMSLEDQ RGSSFVIPWS REEMLNVASP ELCFRGRKYS LREPFSESAL CNFPEPIGPV RCYSVSNDEA YTIPAFLRIS NFYYLAGGSD IEVLRALYRL GILSDVPVKL RKATVTPREL LYHILPPTPS PEYIVRVVKE GDLEDAYFAL QVYAEGEVRG ERAVSKRYLV FPSQRRVNEL MPGATYITYP TALSLLAVLS AVKGRRLRGV VPGEALPGPI RRAVLDYLRV QGITVGEEFR TVA
|
| |