Gene Arth_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1404 
SymbolispG 
ID4446075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1563524 
End bp1564690 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content64% 
IMG OID639689215 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_830898 
Protein GI116669965 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.502797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCTCGG TCAGCCTGGG AATGCCGTCA GCACCGCCGC CCGTCCTTGC CCCGCGCAGG 
AAGACCCGCC AGATCAAGGT GGGCTCGGTG GGCGTCGGCT CGGATTCGCC CATCAGCGTG
CAGTCGATGA CCACCACGCC CACTACGGAC ATCAACGCCA CGCTGCAGCA AATTGCCGAG
CTGACGGCCT CGGGCTGCGA CATCGTGCGC GTGGCCTGCC CCTCCGCAGA TGATGCGGAA
GCCCTGCCGA TCATCGCGCG GAAGTCCCAG ATCCCCGTCA TCGCGGACAT CCATTTCCAG
CCCAAGTACG TCTTCGCCGC CATCGAGGCG GGTTGTGCTG CGGTGCGCGT CAACCCCGGC
AACATCCGTA AGTTCGATGA CCAGGTCAAG GAGATCGCCC GCGCCGCCAA GGACCACGGC
ACCTCCATCC GGATCGGCGT CAATGCAGGA TCCCTGGAGC CCGGGATCCT GAAGAAGTAC
GGCAAGGCAA CCCCCGAGGC CCTAGTGGAG TCAGCGGTCT GGGAAGCCTC GCTGTTCGAG
GAGCATGGCT TCCACGACTT CAAGATCTCC GTCAAGCACA ACGACCCCGT GATCATGGTC
GCCGCGTACG AGATGCTGGC AGAAAAGGGC GACTGGCCCC TTCACCTCGG CGTTACCGAG
GCCGGACCGG CCTTCCAGGG CACCATCAAG TCCGCCACCG CCTTCGGCGC ACTCCTGTCC
AGGGGCATCG GCGACACCAT CCGCGTTTCC CTCTCGGCCC CTCCCGTCGA GGAAATCAAG
GTTGGCAACC AGATCCTGCA GTCCCTCAAC CTGCGCCCCC GCAAGCTCGA AATTGTCTCT
TGCCCGTCCT GCGGCCGCGC CCAGGTGGAC GTCTACACCC TCGCCGAGCA GGTCACAGCG
GGGCTGGAAG GCATGGAGAT CCCGTTGCGC GTGGCCGTCA TGGGCTGCGT CGTCAACGGC
CCGGGCGAGG CGCGCGAAGC CGACCTCGGC GTGGCCTCCG GTAACGGCAA GGGACAAATC
TTTGTGAAGG GAGAGGTCAT CAAGACTGTC CCTGAGAGCG AAATTGTTGA GACACTGATC
GAAGAGGCCA TGCGAATCGC TGAAGAGATG GGGGAGGCCG ATGGCGAAGA TGCTGTCAAG
GGTAGCCCCG TGGTTAGCGT CTCGTAA
 
Protein sequence
MTSVSLGMPS APPPVLAPRR KTRQIKVGSV GVGSDSPISV QSMTTTPTTD INATLQQIAE 
LTASGCDIVR VACPSADDAE ALPIIARKSQ IPVIADIHFQ PKYVFAAIEA GCAAVRVNPG
NIRKFDDQVK EIARAAKDHG TSIRIGVNAG SLEPGILKKY GKATPEALVE SAVWEASLFE
EHGFHDFKIS VKHNDPVIMV AAYEMLAEKG DWPLHLGVTE AGPAFQGTIK SATAFGALLS
RGIGDTIRVS LSAPPVEEIK VGNQILQSLN LRPRKLEIVS CPSCGRAQVD VYTLAEQVTA
GLEGMEIPLR VAVMGCVVNG PGEAREADLG VASGNGKGQI FVKGEVIKTV PESEIVETLI
EEAMRIAEEM GEADGEDAVK GSPVVSVS