Gene Phep_3456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3456 
Symbol 
ID8254576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4108263 
End bp4110197 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content44% 
IMG OID644937108 
Product1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 
Protein accessionYP_003093711 
Protein GI255533339 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.618512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.044476 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGATA CAGTATTATA TTCAAGGTTT TTAACACGTG AAGTTGCCAT AGGAGATGTC 
CCAATGGGTG GTTTAAACCC CATCCGTATT CAGTCTATGA CTACTACAGA TACCATGGAT
ACCATAGGCA CAGTGGAGCA GACCATCAGA ATGGTGAATT CCGGTTGCGA ATATGTACGG
ATTACTGCCC CAAGTATGAA AGAGGCAGAA AACCTGGCCA ATATTAAAAA GGAACTGAGG
TTAAGGGGAT ATAACGTGCC CCTCGTAGCT GATATACATT TTACCCCGAA TGCAGCAGAA
GCAGCAGCAC GGATAGTAGA AAAGGTAAGG GTAAACCCAG GTAATTATGC CGATAAAAAA
CGCTTTGAAA ATATTGAATA TACCCATCAG GCTTATCAGG CTGAACTGGA TCGCATCTAT
AAGAAATTTA CACCTTTAGT AAAGATCTGT AAAGAATATG GTACAGCCAT GCGCATTGGC
ACCAACCATG GCTCACTTTC CGACCGCATC ATGAGCCATT ATGGAGATAC TCCCCGTGGC
ATGGTAGAAT CGGCCATGGA GTTCATCCGT ATGTGCGAAG ACCTGAACTA CTACAACCTG
GTCATTTCTA TGAAAGCCAG TAATACCCAG GTAATGGTAC AGGCCTACAG GCTGTTGGTA
GAAACTATGG TAAAAGAAGG TATGAATTAT CCTTTGCACC TGGGAGTTAC GGAGGCCGGT
GACGGTGAAG ACGGACGGAT CAAATCGGCG GTGGGCATTG GTACCTTACT TGAAGATGGT
CTTGGTGATA CGATCAGGGT CTCTTTAACT GAAGATCCTG AATTTGAGGC ACCTGTTGCC
AAAGCACTGG CAATGAGATA TGAAAAGCGT GCCCTTGATC TGTATGGAAA AATGCTGTTC
AGTGAAACAA AAACCATTGT TCCCGGGCAG CTTCCTTACT CTCCTTTCGA ATACCAGAGA
AGAGTTACAG TTCCGGTACA GCACATTGGG GGACATTTTC ACCCGGCGGT AATGCTGGAT
GTTAGCCATG AAAATTTAAA GGATCCTTAC TTTCTGGCTG CTGTGGGTTA TCAATACAGT
GCCGGACTGG ATAAGTACAA TATGGCCGAT CAGGCCTGTG ATATAGTTTA CCTGGGCGAT
CAACTGCCCT CATTTTCTTT TCCCGGAAAC TTAAAGCAGG TATATAACTA CAGTACCTGG
CTTTCCCTAA AGGACAAAAA CAACTGCCAT CCTTTGCTTT CCTATGATGA ATATCTGTCA
GTGGAGCTTC ATGATGAGCA ATTAAACCTG GTTAAAATTA CTGCTGAAAA TGCGCTGCAG
ACCGATCTTT CTGCTTTAAA AGATAAAGTA GTACTCGTAT TGGAAACTTC ATCATTAAAT
GGAGTGGCTG CTCAACGGGC ATTTTTCAAT GCGCTGATAG AAAAAGGGCT GCAGATACCG
GTTATCATTA AACGCAGTTA TACTGCTGTT AATACCGATG ACCTGATGTT ATATGCTGCA
ACGGACATGG GTGCATTGTT AACGGACGGA ATGGGGGACG GGGTGTGGAT TGATGCTGAT
GCTTTGCTGG GTTTGTCTTT AATTAATGCT ACCAGTTTCG GAATTTTGCA GGCCACACGT
ACCCGTATCT CTAAAACAGA ATACATTTCA TGTCCAAGCT GCGGAAGGAC GTTGTTCGAT
TTACAGGAAA CCACGCAGTT GATCCGTTCC CGTACTGACC ATTTAAAGGG TATCAAGATT
GGCATTATGG GCTGTATCGT AAATGGTCCT GGTGAAATGG CAGATGCAGA TTATGGTTAT
GTAGGGACCG GACCCGATAA AATAACATTA TACCGCGGTA AGGAAGTGGT GAAAAAGAAT
GTAAATGCTG CCCGGGCACT GGATGATTTG ATCGATTTGA TCAAGGAGGA TGGAAATTGG
GTTCAAAAAG TGTAA
 
Protein sequence
MEDTVLYSRF LTREVAIGDV PMGGLNPIRI QSMTTTDTMD TIGTVEQTIR MVNSGCEYVR 
ITAPSMKEAE NLANIKKELR LRGYNVPLVA DIHFTPNAAE AAARIVEKVR VNPGNYADKK
RFENIEYTHQ AYQAELDRIY KKFTPLVKIC KEYGTAMRIG TNHGSLSDRI MSHYGDTPRG
MVESAMEFIR MCEDLNYYNL VISMKASNTQ VMVQAYRLLV ETMVKEGMNY PLHLGVTEAG
DGEDGRIKSA VGIGTLLEDG LGDTIRVSLT EDPEFEAPVA KALAMRYEKR ALDLYGKMLF
SETKTIVPGQ LPYSPFEYQR RVTVPVQHIG GHFHPAVMLD VSHENLKDPY FLAAVGYQYS
AGLDKYNMAD QACDIVYLGD QLPSFSFPGN LKQVYNYSTW LSLKDKNNCH PLLSYDEYLS
VELHDEQLNL VKITAENALQ TDLSALKDKV VLVLETSSLN GVAAQRAFFN ALIEKGLQIP
VIIKRSYTAV NTDDLMLYAA TDMGALLTDG MGDGVWIDAD ALLGLSLINA TSFGILQATR
TRISKTEYIS CPSCGRTLFD LQETTQLIRS RTDHLKGIKI GIMGCIVNGP GEMADADYGY
VGTGPDKITL YRGKEVVKKN VNAARALDDL IDLIKEDGNW VQKV