Gene Tneu_1881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1881 
Symbol 
ID6165202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1658623 
End bp1659957 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content63% 
IMG OID641669043 
Productphosphomethylpyrimidine kinase 
Protein accessionYP_001795242 
Protein GI171186323 
COG category[H] Coenzyme transport and metabolism
[S] Function unknown 
COG ID[COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase
[COG1992] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00097] phosphomethylpyrimidine kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000178657 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.46671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATAG CCGGTTTAGA CTCAGGCGGC GGCGCGGGGA TACACGCAGA CGTGAAGACC 
TTCGCGGCGA TGGGGGTCCA CGGCACCACG GCGCTGACCT GCGTAACCGC ACAAAACACA
TACGAGGTCA GGGAGGCCCA GTGCCTCCAG CCGCCGCTCG TGAAGGCGCA GATACTGGCG
GTTTGGGACG ACATGGGCAT AGACGCGGGC AAGACCGGAA TGCTGGGGAC AAGAGAAATA
ATCGAGGAGG TGGCCGCCAC GGTCTCCAAG CTGGGCTTCC CCCTGGTGGT GGACCCCGTA
ATGATCGCGA AGTCGGGCGC TCCGCTGATA TCTGAAGACG CCGTAGACAC CCTCAAGAGG
AGGTTGCTCC CAGTGGCGAA GGTAGTCACG CCCAATAGAC ACGAGGCGGA GAAGCTGACC
GGCATAAAGA TAACCAGCGT CGCGGAGGCG AGGAGAGCCG CGGAGGTCAT ACACAGGGAG
TTCGGCACAG AGGTGGTGGT GGTCAAGGGA GGCCACCTAG ACGCCCCGGA GGCCGTCGAC
GTGGTGTACA TAGGCGGCAC CTTCCACGAG CTGGCGACCC CCCGCCTAGA CTCGCGGGCG
ACCCACGGAA CCGGTTGCTC CTACTCCGCC GCCATAGCCG CGGGGCTCGC CAAGGGCCTA
CCCCCCCTGG AGGCCATAAA GACCGCAAAG CGCTTCATCT ACATGGCCAT CAGATACGGG
GTGGCCAGGG GAAAGGGCCA CTGGCCCGTC AACCCCATGG CCTGGCTAGA GATGCCGGCG
GAGAGGTGGA GGACGTTGGA GGAGCTGAGG GAGGCCCTAG AGGCCGTGGA GAGGCAAGCC
GAGGTCTTTG CAAAGGCCAT ACCCGAGGTG CAGACGAACA TAGGATACGC CATAGACCCC
CGCTACGCCA CGACGAGAGA AGACGTCGCC GCGGTGCCCG GGAGGATAGT GAACTACATG
GGCCGCGCCA AGCCCTCCGG CCCGCCCGCC TTCGGCGCAA GCGACCACAT AGCCAGGAAG
ATACTGGCGG CGCTGGCGAA AGACCCCCAG GCCAGGTCGG CCATGAACAT AAGGCTAGAC
GCGGCGTACA TAGAGAAGGC GAAGAGCCTA GGCATGACAA TAGCCTACGT AGACAGGAGG
AGGGAGCCGG AGGACGTCAA GAAGAGAGAG GGAGCCACCA TGCAGTGGAT CATCGAAGAG
GCCTATAGAC AGACAGGGGG AAAAACGCCA GACCTAATAG TGGACTGGGG AGACTGGGGC
AAGGAGCCCA TCATAACCGT ACTGGGGAAA ACCCCAAGGG AGGTGGTAGA GAAGGTCCTT
CGACTCATAC GCTAA
 
Protein sequence
MTIAGLDSGG GAGIHADVKT FAAMGVHGTT ALTCVTAQNT YEVREAQCLQ PPLVKAQILA 
VWDDMGIDAG KTGMLGTREI IEEVAATVSK LGFPLVVDPV MIAKSGAPLI SEDAVDTLKR
RLLPVAKVVT PNRHEAEKLT GIKITSVAEA RRAAEVIHRE FGTEVVVVKG GHLDAPEAVD
VVYIGGTFHE LATPRLDSRA THGTGCSYSA AIAAGLAKGL PPLEAIKTAK RFIYMAIRYG
VARGKGHWPV NPMAWLEMPA ERWRTLEELR EALEAVERQA EVFAKAIPEV QTNIGYAIDP
RYATTREDVA AVPGRIVNYM GRAKPSGPPA FGASDHIARK ILAALAKDPQ ARSAMNIRLD
AAYIEKAKSL GMTIAYVDRR REPEDVKKRE GATMQWIIEE AYRQTGGKTP DLIVDWGDWG
KEPIITVLGK TPREVVEKVL RLIR