Gene Tpen_1231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1231 
Symbol 
ID4601163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1168671 
End bp1169714 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content66% 
IMG OID639774007 
Product3-dehydroquinate synthase 
Protein accessionYP_920632 
Protein GI119720137 
COG category[C] Energy production and conversion 
COG ID[COG0371] Glycerol dehydrogenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0263473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGGGCTG AGCTCCCGAA GAGGGTTGTA GTCGAGAGGG GAGCCTTGCA GTTTCTACCC 
GAGGTTCTAC GCGAGCTTGG ATGCTCGAAG ACCGTAGTCG TAACGGATAG TGGTGTTTGG
AGCGTTGTGG GGAGCGTCGT CGAGGGTGCT TTAAGGGGGC TGGCCTACGA GGTCGTATAT
ATCGAGGCGG CGGATAACTC GAACGTTGAG AGAGCGCGCT CGGCGGCTAG GAGGGTGGAG
GCTTGCGCCG TGGCTGGGCT GGGGGGCGGG CGGCCCGTCG ACGTCGCGAA GTACGCGGCG
TTCATGGAGG GGCTCCCCTT CGTGAGCGTG CCCACGGCGA TAAGCCACGA CGGCTTCGCC
TCGCCCATAG TGGCGCTCAA GGACCCGGAG GGGAACCCCC TGTCTATATT CACGAGGCCG
CCCGCCGCCG TGCTCGTGGA CCTGGCGGTC GTGTCGAGGG CTCCGAGGAG GCTCCTCGCG
AGCGGGGTCG GGGACATAGT CGGAAAGGTT ACCAGCGTCG CGGACGCCAG GCTTGCCCAG
AGGCTTACAG GTGAGGAGGT CCCGGAGGTA GCCCTCAGGA TGGCGGAGAC GGCGGCCAGG
ATGGTCCTGG ACGAGGTGGA CGAGATAGCT TCGTGGACTG AGAGGGGTGT AGGCGTCCTG
GCGCAGGCCG GGCTACTCGC AGGCATGGCC ATGGCGGTGG CCGGTAGCTC GAGGCCCTGT
AGCGGCTCGG AGCACCTCTT CAGCCACTCC CTGGACAAGT ATGTGCCGTG GAAGAAGAGC
CTCCACGGGG AGCAGGTAGG CGTGGGCGCG ATCATAGCGT CGTACCTCCA CGGGTTCAAC
TGGAGGGTTA TCCGGGACGC CCTCGCGAAG GTCGGGGCGC CGACGACCGT GGAGGGGCTC
GGGGTAACCG GGGAGGACGC GGTCCGCGCC CTCCTCAAGG CCAGGGAGCT GAGAAAGAGG
TTCACGATCC TCGACGTAGT CGAGCTCAAC GAGGGGCTCG CCTGGAAGGT GCTCAGGGAA
ACCGGGGTAG CGCCCACGGC CTAA
 
Protein sequence
MRAELPKRVV VERGALQFLP EVLRELGCSK TVVVTDSGVW SVVGSVVEGA LRGLAYEVVY 
IEAADNSNVE RARSAARRVE ACAVAGLGGG RPVDVAKYAA FMEGLPFVSV PTAISHDGFA
SPIVALKDPE GNPLSIFTRP PAAVLVDLAV VSRAPRRLLA SGVGDIVGKV TSVADARLAQ
RLTGEEVPEV ALRMAETAAR MVLDEVDEIA SWTERGVGVL AQAGLLAGMA MAVAGSSRPC
SGSEHLFSHS LDKYVPWKKS LHGEQVGVGA IIASYLHGFN WRVIRDALAK VGAPTTVEGL
GVTGEDAVRA LLKARELRKR FTILDVVELN EGLAWKVLRE TGVAPTA