Gene Tpen_1202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1202 
Symbol 
ID4600394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1140481 
End bp1141692 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content62% 
IMG OID639773978 
Productsaccharopine dehydrogenase 
Protein accessionYP_920603 
Protein GI119720108 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1748] Saccharopine dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATAG TTGTTGTCGG TTGCGGGGCT GTCGGCTCCC TCGTGGCTAG GCTCGCGGCC 
AAGTGGAAGG TTGCGGACGA GGTTCTGTGC TTGGACAAGG ACGTGGAGAG GGCTAAGAGG
TACCTCGACT ACCCGGAGCC GCTAGGCATA CCGGTGGAGA GGGCGGACGC CCTCGCCGCC
GAGGAGCTAA AGGCGAAGGT AGCGGGCTAC GACTTCTTGG TGAACTCTCT CCCGACGTTC
GTAAAGGTCG ACAAGGCTGA AAGGTTGCTC AACCCGCAGC TAATGAGCGT CGCGCTGAAA
GCGGGGCTCA ACTACGCGGA CCTCGCCTGC TACGGGGGGA AGAGGAGGAG GGCCGAGCAG
CTCTCCTTCT CCAAGGCGTT CAGCGAGGCG GGCCTTCTCG CCCTCATAAA CATGGGGGCC
TCCCCCGGCC TTTCCAACAT ACTCGCGAGG GAGGTCTACG AGGATCTCGA CTCGGCGGAG
TCTCTCTACG TGATGTCCCT CGAGGACCAG AGGGGGAGCT CGTTCGTGAT TCCGTGGTCG
AGGGAGGAGA TGCTCAACGT TGCTTCGCCT GAGCTGTGTT TCCGCGGCAG GAAGTACTCC
CTCAGGGAGC CCTTCTCCGA GAGCGCGCTC TGCAACTTCC CGGAGCCCAT AGGCCCCGTT
AGGTGCTACT CCGTCTCTAA TGACGAAGCC TACACGATCC CGGCTTTCCT GAGGATCTCG
AACTTCTACT ACCTGGCCGG CGGGAGCGAC ATAGAAGTCC TGAGGGCTCT GTACAGGCTC
GGCATACTGA GCGACGTGCC CGTGAAGCTA CGCAAGGCGA CGGTAACCCC CAGGGAGCTA
CTCTACCACA TCCTGCCCCC GACCCCCTCC CCCGAGTACA TAGTCAGGGT GGTAAAGGAG
GGGGACCTCG AGGACGCCTA CTTCGCGCTA CAGGTGTACG CCGAGGGCGA GGTTAGAGGC
GAGAGGGCGG TCTCGAAGAG GTACCTCGTC TTCCCATCGC AGAGAAGGGT AAACGAGCTG
ATGCCGGGGG CTACCTACAT CACGTACCCC ACGGCTCTGA GCCTCCTAGC CGTGCTCAGC
GCGGTTAAGG GGAGAAGGCT TAGGGGGGTC GTACCCGGTG AAGCCCTACC CGGGCCCATT
AGGCGCGCGG TACTGGACTA CCTGAGGGTT CAGGGGATAA CTGTAGGCGA GGAGTTCAGG
ACCGTTGCCT AG
 
Protein sequence
MKIVVVGCGA VGSLVARLAA KWKVADEVLC LDKDVERAKR YLDYPEPLGI PVERADALAA 
EELKAKVAGY DFLVNSLPTF VKVDKAERLL NPQLMSVALK AGLNYADLAC YGGKRRRAEQ
LSFSKAFSEA GLLALINMGA SPGLSNILAR EVYEDLDSAE SLYVMSLEDQ RGSSFVIPWS
REEMLNVASP ELCFRGRKYS LREPFSESAL CNFPEPIGPV RCYSVSNDEA YTIPAFLRIS
NFYYLAGGSD IEVLRALYRL GILSDVPVKL RKATVTPREL LYHILPPTPS PEYIVRVVKE
GDLEDAYFAL QVYAEGEVRG ERAVSKRYLV FPSQRRVNEL MPGATYITYP TALSLLAVLS
AVKGRRLRGV VPGEALPGPI RRAVLDYLRV QGITVGEEFR TVA