Gene Arth_2392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2392 
Symbol 
ID4444975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2679295 
End bp2680890 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content68% 
IMG OID639690202 
Productmultifunctional hydroxymethylpyrimidine phosphokinase/4-amino-5-aminomethyl-2-methylpyrimidine hydrolase 
Protein accessionYP_831871 
Protein GI116670938 
COG category[H] Coenzyme transport and metabolism
[K] Transcription 
COG ID[COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase
[COG0819] Putative transcription activator 
TIGRFAM ID[TIGR00097] phosphomethylpyrimidine kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0171536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCCTT TGGCCTCATC ATTCCTGCCC ACCCCTGCCT CCGAAGTCGC CCGCGAATCA 
TCACCGGGCG GCACTGTCCT GCGCAGTACG CCCCGGGTCC TGGCCATCGC CGGCTCCGAT
CCGTCCGGCG GGGCGGGGAT CCAGGCCGAC CTCAAAAGCA TTGCCGCAAA CGGCGGATAC
GGCATGGCTG CCATCACCGC CCTGACGGCG CAGAACACCC GTGGCGTTCG TGCCGTGCAC
GTGCCCCCGG CGGATTTCCT GACGGCCCAG CTTGAAGCCA TCAGCGACGA CATCAGCATT
GACGCCGTCA AGATCGGCAT GCTGGGTGAC TCTTCGGTGA TCGCCGCTGT CCGCAGCTGG
CTGGAGAAGG CCCGTCCCGC CGTCGTGGTT CTTGACCCCG TGATGGTCGC CACCAGCGGG
GACAGGCTCC TGCAGGAGGC GGCCGAGGCG GCACTGCGCG AACTCCTGCC CCTCGCCGAC
CTCGTCACTC CCAACCTGGC GGAACTGGCG ATGCTCCTCA ACGAACCGCT CGCGGACGAC
TGGGAGGCGG CACTCGCCCA GGGGAAGCGC CTCGCCGCCC GGACCGGCGC CACTGTGCTC
GTCAAGGGCG GACACCTCGA CGGCGGGGAG TGCCCTGACG CGCTGGTCAA CACGGCAGGG
CTGCTCGCCC AGGACGTTGT GGTTGTACCC GGCGAGCGGA TCGATACCAT GAACAGCCAC
GGCACCGGCT GCTCCCTGTC CTCGGCAATG GCCACCGCGC AGGCGAGGCT GGGGGACTGG
GAGGAATCCT TGCGGACAGT GAAGCCATGG TTGCAGGGGG CGCTCCGGGA AGCCGGCGCC
TTGGACGTGG GAACAGGCAA CGGCCCGGTG CACCATTTCC ACCACCTGGC CCCCAAAGGA
AGTGATGCGC CCCCCGAAGG CCGGTTCGCA GCGGTGCTCT GGCAAGATGC CGGGCCGGAC
CTGGACGCCG TCTACGAGCT CGACTTCATC CGCGGCCTGG CCGACGGCTC CCTCACCGAG
CAGCACTTCG CCTATTACCT TGCCCAGGAT GCCATCTACC TGAACGGCTA TTCCCGGGTA
CTTTCGCGCG CCGCCGCCAT TGCCCCGACC GAGGTGGAAC AGCTGTTCTG GGCGCGGTCG
GCACAGCAAT GCCTTGAAGT CGAGTCCGAA CTGCACCGGA CATGGCTCAG CACACGGAAC
GTGGACACCG CACTCGGACC GGTTACGAAG TCCTACGTGG ACCACTTGCT GGCCTCATCC
GTTTCAGGCA GCTACGGGGT ACTCGTCGCC GCTGTGCTCC CATGCTTCTG GCTGTATGCA
GAGGTGGGTG CCACCCTGCA CGGGCAGTTC CTTGCTGCCG GGTCGGCCCC GGACCACCCG
TACGCCGAAT GGCTCCGCAC CTACGCGGAC GAAGGGTTTG CCGCCGCCAC CCGGCAGGCG
GTGCGCATTG CCGACGACGC TGCCCGTGCC GCGTCTGACG CGGAGCGGCA AGCCATGCGG
GTGGCCTTCC GGCAGTCGTG CCGGTACGAG GTGGAATTCT TCGACGCGCC GAGGCTTCAC
GCTGCACCGC AAAGCATTCC CGAGCCGGTA CGATAG
 
Protein sequence
MSPLASSFLP TPASEVARES SPGGTVLRST PRVLAIAGSD PSGGAGIQAD LKSIAANGGY 
GMAAITALTA QNTRGVRAVH VPPADFLTAQ LEAISDDISI DAVKIGMLGD SSVIAAVRSW
LEKARPAVVV LDPVMVATSG DRLLQEAAEA ALRELLPLAD LVTPNLAELA MLLNEPLADD
WEAALAQGKR LAARTGATVL VKGGHLDGGE CPDALVNTAG LLAQDVVVVP GERIDTMNSH
GTGCSLSSAM ATAQARLGDW EESLRTVKPW LQGALREAGA LDVGTGNGPV HHFHHLAPKG
SDAPPEGRFA AVLWQDAGPD LDAVYELDFI RGLADGSLTE QHFAYYLAQD AIYLNGYSRV
LSRAAAIAPT EVEQLFWARS AQQCLEVESE LHRTWLSTRN VDTALGPVTK SYVDHLLASS
VSGSYGVLVA AVLPCFWLYA EVGATLHGQF LAAGSAPDHP YAEWLRTYAD EGFAAATRQA
VRIADDAARA ASDAERQAMR VAFRQSCRYE VEFFDAPRLH AAPQSIPEPV R