Gene Tneu_1934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1934 
Symbol 
ID6164996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1706397 
End bp1707818 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content59% 
IMG OID641669097 
Productmetal dependent phosphohydrolase 
Protein accessionYP_001795295 
Protein GI171186376 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGTATA GAAAAGCCAT TAGGGACCCC ATCCACGGCT TCATAAAGCT GACGGAGGAG 
GAGGTTAAGC TGATCGATGG AGAGCCGCTT ATCCAGAGGC TTAGGTATGT AAAACAGCTG
GGCTTCGTCT ACCTGGTCTA CCCGACTGCC ACCCACACCA GATTCGACCA CAGCCTCGGC
GTGATGCACA TCGCCACCTT AATCGGCGAG AGGATACTTC AGCAGCTGGG GTCGTTAGAC
GAGGAGGCGC TGAGGCATCT GCGGGTCGCC GCCCTGCTCC ACGACATCGG CCACCTCCCC
TTCTCCCACT CCTTCGAGGT TTTGACGAGG GAGCTGTTGC ACATGGCGGC GAAGAGGGGG
TGCGTCGACG TGGATCTAGC TCTGTTCGAC GTGGGGAAGC CGCATGAGGT GACGACGAGA
CTTCTCCTGG AGAAGCTGGG GCCGAGGCTG TCGGAGCTTG GGTACAGCCC GGAGCTCGTC
AAGGAGCTCC TCTTCGAGGG GGGCGGGAGG TATGCGCCTC TCGGCGTTAT ACTATCCGGC
GTGTTAGACG CAGACAGGCT GGACTACATA ATGCGCGACA TGTATTTCAC GGGGGCCGCG
GTGGGCACCA GCTTCACCCA CATAGACCTG GAGAGGGTCA TCGAAAACCT GGAGGTGGTC
GGCGGGAGGC TCCAGTTCAA CGAGAAGGCC AGGGTGAACC TGGAGGGGTA CCTCATAACT
AGGTACAACC TATACCGGCA CGTCTATCTA CACCACAAGA CCGTGCTGTT TACAGAGATC
GCGCGGAACA TCCTCGCGGA CAACATAGCG AGGTGTTCAG ACGGCGGGGG AGACGGCGCT
GTGTGCCAGT ACCTGTGCGA TCTGGCTAAG TTCGTCGCAG GTCAGGCAGA GGGCGAGGTG
GTGTGGAAGG CCACGGACGA CTACTTCGTC TCGGTCTTCT CCAGAGACCC CAGGTTCAGG
GATCTTCTCT CTAGGAGGCC GCCGCAGTAC GTCGCTTTGT GGAAGAGGGA GAAGGACTTC
ATAGAGGTCT TCAGGGACCC GGCCAGGATC AACGAGGTGG TGGATAGGCT GGGGCCTCTC
CACTGGGCTC TCGTGGATAG GCTGAGGAAG AAGCTAGTGG AGAGGCTTAA CGTGGAGCTG
GGGGGCGGAT GTGGCCTGTC GTATGAAGAC GTCATAATCT CCTACGTTAG CTTTGATCCC
TACGCCGAAG ATTTGTACAT ATCCACCGCC ATGGGCCCCA TAGCGATTTC GAAGATCTCT
CCGCTTGTGG AGGCGGTAAA CGAGGCTTGG AGGAGGGCGC CTCACGTGTT CCTCTACGTG
AGGAAGGAGG CGCTGGAGAG GTGCGGCGGC GAGGCGCTGG AGAGCCTCAA ATCGTTTCTG
GAGCCTCTGA TGGAGTTGGC GGTGAGGAGG ATCTCAGCTT AG
 
Protein sequence
MQYRKAIRDP IHGFIKLTEE EVKLIDGEPL IQRLRYVKQL GFVYLVYPTA THTRFDHSLG 
VMHIATLIGE RILQQLGSLD EEALRHLRVA ALLHDIGHLP FSHSFEVLTR ELLHMAAKRG
CVDVDLALFD VGKPHEVTTR LLLEKLGPRL SELGYSPELV KELLFEGGGR YAPLGVILSG
VLDADRLDYI MRDMYFTGAA VGTSFTHIDL ERVIENLEVV GGRLQFNEKA RVNLEGYLIT
RYNLYRHVYL HHKTVLFTEI ARNILADNIA RCSDGGGDGA VCQYLCDLAK FVAGQAEGEV
VWKATDDYFV SVFSRDPRFR DLLSRRPPQY VALWKREKDF IEVFRDPARI NEVVDRLGPL
HWALVDRLRK KLVERLNVEL GGGCGLSYED VIISYVSFDP YAEDLYISTA MGPIAISKIS
PLVEAVNEAW RRAPHVFLYV RKEALERCGG EALESLKSFL EPLMELAVRR ISA