Gene Tneu_0808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0808 
Symbol 
ID6164298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp725536 
End bp726423 
Gene Length888 bp 
Protein Length295 aa 
Translation table11 
GC content67% 
IMG OID641667966 
Product3-dehydroquinate dehydratase, type I 
Protein accessionYP_001794193 
Protein GI171185274 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0710] 3-dehydroquinate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01093] 3-dehydroquinate dehydratase, type I
[TIGR01808] monofunctional chorismate mutase, high GC gram positive type 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.716383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATATGCG GCGCGGTACC GGTTAGAAAG CCGGCGGACG TATATAGAGC TCTGGACTCC 
CCCGCCCCCT GCCTCGAGCT AAGGCTCGAC TATCTGGAGA GCTCCCTCGC CGAGGCGAAG
CCCGCGTTGG AGGAGGCGGT CGCGAGGAGG ACAGTCATAT TAACAGTTAG GAGGAGGGAG
GAGGGGGGCG CCTGGCGGGG CACCGAGGAG GAGAGGGCGG CCCTCTACCT AAAGCTCCTG
GAGCTGACGC CCCACTTCGT AGACGTGGAG GCCGCCGCGC CGGCGGCTGG GCAGGTGGCG
GCCGCCAGGG GGAGGACTAA GCTTATAGCC AGCAGACACG ACTTCGGCGG GACCCCCCCG
TATGAAACCC TCCTCTCCTG GGCCCGGGAG GCGGCGGCCT TGGGCGACGT GGTGAAGATA
GTCACCTACG CCAGAGAGCC CCGGGACGGC CTCGCCGTTC TCTCCCTAAT CGGCGCCGTG
GAGAAGCCGA CGGTGGCCTT CGCCATGGGG CCGGCCGGGG CCTACACCAG GCTGGCGGCG
GCGGCCCTGG GGAGCCCCAT CATGTACGTA TCGCTGGGCG AGGCGACGGC GCCTGGCCAG
ATATCCCTAG ACGCCTACTA CGCCGCGCTC CTGGGCATGG GGGCCGCCCC CGGGGGTGAG
GGTCTGCCGG CGCTGAGGGA GGCGCTGGAC TGGATAGACG GCGCCCTCAT GCACCTTCTC
AAGAGGAGGC TGGAGGTGTG CCGCGACATG GGGAAGATAA AGAAGTCCGC CGGTCTCCCC
ATCTACGACG ACATCAGAGA GGCCCAGGTC TTGAAGAGGG CGGGCGACTT TAAACAGATC
TTCGAGCTGG TGGTGCAGAT GTGCAAGGCG GTACAACTAG TCGCCTAG
 
Protein sequence
MICGAVPVRK PADVYRALDS PAPCLELRLD YLESSLAEAK PALEEAVARR TVILTVRRRE 
EGGAWRGTEE ERAALYLKLL ELTPHFVDVE AAAPAAGQVA AARGRTKLIA SRHDFGGTPP
YETLLSWARE AAALGDVVKI VTYAREPRDG LAVLSLIGAV EKPTVAFAMG PAGAYTRLAA
AALGSPIMYV SLGEATAPGQ ISLDAYYAAL LGMGAAPGGE GLPALREALD WIDGALMHLL
KRRLEVCRDM GKIKKSAGLP IYDDIREAQV LKRAGDFKQI FELVVQMCKA VQLVA